Overview

Brought to you by YData

Dataset statistics

Number of variables223
Number of observations1926389
Missing cells303438452
Missing cells (%)70.6%
Total size in memory3.2 GiB
Average record size in memory1.7 KiB

Variable types

Numeric25
Unsupported105
Text90
Boolean3

Dataset

DescriptionInvertebrate Zoology NMNH Extant Specimen Records
CreatorBen Norton
AuthorBen Norton
URLhttps://doi.org/10.15468/dl.fya67r

Alerts

license has constant value "CC0_1_0" Constant
publisher has constant value "National Museum of Natural History, Smithsonian Institution" Constant
institutionID has constant value "urn:lsid:biocol.org:col:34871" Constant
collectionID has constant value "urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6" Constant
institutionCode has constant value "USNM" Constant
collectionCode has constant value "IZ" Constant
datasetName has constant value "NMNH Extant Biology" Constant
materialSampleID has constant value "NORTH_AMERICA" Constant
eventID has constant value "North Pacific Ocean, Gulf Of California" Constant
samplingEffort has constant value "24.1667" Constant
fieldNotes has constant value "-110.283" Constant
georeferenceSources has constant value "PARATYPE" Constant
acceptedNameUsage has constant value "SPECIES" Constant
namePublishedIn has constant value "ACCEPTED" Constant
subgenus has constant value "False" Constant
cultivarEpithet has constant value "108.0" Constant
subgenusKey has constant value "NE" Constant
protocol has constant value "EML" Constant
lastCrawled has constant value "2024-12-02T11:48:23.416Z" Constant
publishedByGbifRegion has constant value "NORTH_AMERICA" Constant
isSequenced is highly imbalanced (97.3%) Imbalance
accessRights has 1926389 (100.0%) missing values Missing
bibliographicCitation has 1926389 (100.0%) missing values Missing
language has 1926389 (100.0%) missing values Missing
references has 1926389 (100.0%) missing values Missing
rightsHolder has 1926389 (100.0%) missing values Missing
type has 1926389 (100.0%) missing values Missing
datasetID has 1926389 (100.0%) missing values Missing
ownerInstitutionCode has 1926389 (100.0%) missing values Missing
informationWithheld has 1926389 (100.0%) missing values Missing
dataGeneralizations has 1926389 (100.0%) missing values Missing
dynamicProperties has 1926389 (100.0%) missing values Missing
recordNumber has 1804636 (93.7%) missing values Missing
recordedBy has 764111 (39.7%) missing values Missing
recordedByID has 1926389 (100.0%) missing values Missing
organismQuantity has 1926389 (100.0%) missing values Missing
organismQuantityType has 1926389 (100.0%) missing values Missing
sex has 1802976 (93.6%) missing values Missing
lifeStage has 1888852 (98.1%) missing values Missing
reproductiveCondition has 1926389 (100.0%) missing values Missing
caste has 1926389 (100.0%) missing values Missing
behavior has 1926389 (100.0%) missing values Missing
vitality has 1926389 (100.0%) missing values Missing
establishmentMeans has 1926389 (100.0%) missing values Missing
degreeOfEstablishment has 1926389 (100.0%) missing values Missing
pathway has 1926389 (100.0%) missing values Missing
georeferenceVerificationStatus has 1926389 (100.0%) missing values Missing
disposition has 1926387 (> 99.9%) missing values Missing
associatedOccurrences has 1926387 (> 99.9%) missing values Missing
associatedReferences has 1926387 (> 99.9%) missing values Missing
associatedSequences has 1921265 (99.7%) missing values Missing
associatedTaxa has 1926387 (> 99.9%) missing values Missing
otherCatalogNumbers has 1926389 (100.0%) missing values Missing
occurrenceRemarks has 1144481 (59.4%) missing values Missing
organismID has 1926389 (100.0%) missing values Missing
organismName has 1926389 (100.0%) missing values Missing
organismScope has 1926389 (100.0%) missing values Missing
associatedOrganisms has 1926389 (100.0%) missing values Missing
previousIdentifications has 1926389 (100.0%) missing values Missing
organismRemarks has 1926389 (100.0%) missing values Missing
materialEntityID has 1926389 (100.0%) missing values Missing
materialEntityRemarks has 1926389 (100.0%) missing values Missing
verbatimLabel has 1926387 (> 99.9%) missing values Missing
materialSampleID has 1926387 (> 99.9%) missing values Missing
eventID has 1926388 (> 99.9%) missing values Missing
parentEventID has 1926389 (100.0%) missing values Missing
eventType has 1926389 (100.0%) missing values Missing
fieldNumber has 1339757 (69.5%) missing values Missing
eventDate has 688611 (35.7%) missing values Missing
eventTime has 1926389 (100.0%) missing values Missing
startDayOfYear has 842312 (43.7%) missing values Missing
endDayOfYear has 842310 (43.7%) missing values Missing
year has 689273 (35.8%) missing values Missing
month has 800939 (41.6%) missing values Missing
day has 887052 (46.0%) missing values Missing
verbatimEventDate has 1173196 (60.9%) missing values Missing
habitat has 1857134 (96.4%) missing values Missing
samplingProtocol has 1926389 (100.0%) missing values Missing
sampleSizeValue has 1926389 (100.0%) missing values Missing
sampleSizeUnit has 1926389 (100.0%) missing values Missing
samplingEffort has 1926388 (> 99.9%) missing values Missing
fieldNotes has 1926388 (> 99.9%) missing values Missing
eventRemarks has 1926389 (100.0%) missing values Missing
locationID has 984063 (51.1%) missing values Missing
higherGeographyID has 1926389 (100.0%) missing values Missing
higherGeography has 67830 (3.5%) missing values Missing
continent has 1027390 (53.3%) missing values Missing
waterBody has 666647 (34.6%) missing values Missing
islandGroup has 1925619 (> 99.9%) missing values Missing
island has 1925411 (99.9%) missing values Missing
countryCode has 110758 (5.7%) missing values Missing
stateProvince has 943672 (49.0%) missing values Missing
county has 1786419 (92.7%) missing values Missing
municipality has 1926389 (100.0%) missing values Missing
locality has 642385 (33.3%) missing values Missing
verbatimLocality has 1926389 (100.0%) missing values Missing
verbatimElevation has 1925927 (> 99.9%) missing values Missing
verticalDatum has 1926389 (100.0%) missing values Missing
verbatimDepth has 1900145 (98.6%) missing values Missing
minimumDistanceAboveSurfaceInMeters has 1926389 (100.0%) missing values Missing
maximumDistanceAboveSurfaceInMeters has 1926389 (100.0%) missing values Missing
locationAccordingTo has 1926389 (100.0%) missing values Missing
locationRemarks has 1926389 (100.0%) missing values Missing
decimalLatitude has 927342 (48.1%) missing values Missing
decimalLongitude has 927342 (48.1%) missing values Missing
coordinateUncertaintyInMeters has 1926389 (100.0%) missing values Missing
coordinatePrecision has 1926389 (100.0%) missing values Missing
pointRadiusSpatialFit has 1926389 (100.0%) missing values Missing
verbatimCoordinateSystem has 1246881 (64.7%) missing values Missing
verbatimSRS has 1926389 (100.0%) missing values Missing
footprintWKT has 1926389 (100.0%) missing values Missing
footprintSRS has 1926389 (100.0%) missing values Missing
footprintSpatialFit has 1926389 (100.0%) missing values Missing
georeferencedBy has 1926389 (100.0%) missing values Missing
georeferencedDate has 1926389 (100.0%) missing values Missing
georeferenceProtocol has 1265789 (65.7%) missing values Missing
georeferenceSources has 1926387 (> 99.9%) missing values Missing
georeferenceRemarks has 1896101 (98.4%) missing values Missing
geologicalContextID has 1926389 (100.0%) missing values Missing
earliestEonOrLowestEonothem has 1926389 (100.0%) missing values Missing
latestEonOrHighestEonothem has 1926389 (100.0%) missing values Missing
earliestEraOrLowestErathem has 1926389 (100.0%) missing values Missing
latestEraOrHighestErathem has 1926389 (100.0%) missing values Missing
earliestPeriodOrLowestSystem has 1926389 (100.0%) missing values Missing
latestPeriodOrHighestSystem has 1926389 (100.0%) missing values Missing
earliestEpochOrLowestSeries has 1926387 (> 99.9%) missing values Missing
latestEpochOrHighestSeries has 1926389 (100.0%) missing values Missing
earliestAgeOrLowestStage has 1926389 (100.0%) missing values Missing
latestAgeOrHighestStage has 1926389 (100.0%) missing values Missing
lowestBiostratigraphicZone has 1926389 (100.0%) missing values Missing
highestBiostratigraphicZone has 1926389 (100.0%) missing values Missing
lithostratigraphicTerms has 1926387 (> 99.9%) missing values Missing
group has 1926389 (100.0%) missing values Missing
formation has 1926389 (100.0%) missing values Missing
member has 1926389 (100.0%) missing values Missing
bed has 1926389 (100.0%) missing values Missing
identificationID has 1926389 (100.0%) missing values Missing
verbatimIdentification has 1926389 (100.0%) missing values Missing
identificationQualifier has 1908256 (99.1%) missing values Missing
typeStatus has 1841062 (95.6%) missing values Missing
identifiedBy has 1085204 (56.3%) missing values Missing
identifiedByID has 1926387 (> 99.9%) missing values Missing
dateIdentified has 1926387 (> 99.9%) missing values Missing
identificationReferences has 1926389 (100.0%) missing values Missing
identificationVerificationStatus has 1926387 (> 99.9%) missing values Missing
identificationRemarks has 1926389 (100.0%) missing values Missing
taxonID has 1926389 (100.0%) missing values Missing
scientificNameID has 1926389 (100.0%) missing values Missing
parentNameUsageID has 1926387 (> 99.9%) missing values Missing
originalNameUsageID has 1926389 (100.0%) missing values Missing
nameAccordingToID has 1926389 (100.0%) missing values Missing
namePublishedInID has 1926387 (> 99.9%) missing values Missing
taxonConceptID has 1926389 (100.0%) missing values Missing
acceptedNameUsage has 1926387 (> 99.9%) missing values Missing
parentNameUsage has 1926389 (100.0%) missing values Missing
originalNameUsage has 1926389 (100.0%) missing values Missing
nameAccordingTo has 1926389 (100.0%) missing values Missing
namePublishedIn has 1926387 (> 99.9%) missing values Missing
namePublishedInYear has 1926389 (100.0%) missing values Missing
class has 66153 (3.4%) missing values Missing
order has 329533 (17.1%) missing values Missing
superfamily has 1926389 (100.0%) missing values Missing
family has 144484 (7.5%) missing values Missing
subfamily has 1926389 (100.0%) missing values Missing
tribe has 1926389 (100.0%) missing values Missing
subtribe has 1926387 (> 99.9%) missing values Missing
genus has 358040 (18.6%) missing values Missing
genericName has 358039 (18.6%) missing values Missing
subgenus has 1926387 (> 99.9%) missing values Missing
infragenericEpithet has 1926387 (> 99.9%) missing values Missing
specificEpithet has 626794 (32.5%) missing values Missing
infraspecificEpithet has 1890285 (98.1%) missing values Missing
cultivarEpithet has 1926387 (> 99.9%) missing values Missing
verbatimTaxonRank has 1926387 (> 99.9%) missing values Missing
vernacularName has 1926387 (> 99.9%) missing values Missing
nomenclaturalCode has 1926387 (> 99.9%) missing values Missing
nomenclaturalStatus has 1926387 (> 99.9%) missing values Missing
taxonRemarks has 1926387 (> 99.9%) missing values Missing
elevation has 1919566 (99.6%) missing values Missing
elevationAccuracy has 1922884 (99.8%) missing values Missing
depth has 1143678 (59.4%) missing values Missing
depthAccuracy has 1205336 (62.6%) missing values Missing
distanceFromCentroidInMeters has 1917542 (99.5%) missing values Missing
mediaType has 1683237 (87.4%) missing values Missing
classKey has 66154 (3.4%) missing values Missing
orderKey has 329532 (17.1%) missing values Missing
familyKey has 144484 (7.5%) missing values Missing
genusKey has 358040 (18.6%) missing values Missing
subgenusKey has 1926387 (> 99.9%) missing values Missing
speciesKey has 626818 (32.5%) missing values Missing
species has 626818 (32.5%) missing values Missing
verbatimScientificName has 353771 (18.4%) missing values Missing
typifiedName has 1926389 (100.0%) missing values Missing
repatriated has 110140 (5.7%) missing values Missing
relativeOrganismQuantity has 1926389 (100.0%) missing values Missing
projectId has 1926389 (100.0%) missing values Missing
gbifRegion has 115674 (6.0%) missing values Missing
level0Gid has 1691066 (87.8%) missing values Missing
level0Name has 1691066 (87.8%) missing values Missing
level1Gid has 1694634 (88.0%) missing values Missing
level1Name has 1694634 (88.0%) missing values Missing
level2Gid has 1708980 (88.7%) missing values Missing
level2Name has 1709046 (88.7%) missing values Missing
level3Gid has 1886622 (97.9%) missing values Missing
level3Name has 1887342 (98.0%) missing values Missing
iucnRedListCategory has 469562 (24.4%) missing values Missing
individualCount is highly skewed (γ1 = 100.3827769) Skewed
gbifID has unique values Unique
occurrenceID has unique values Unique
accessRights is an unsupported type, check if it needs cleaning or further analysis Unsupported
bibliographicCitation is an unsupported type, check if it needs cleaning or further analysis Unsupported
language is an unsupported type, check if it needs cleaning or further analysis Unsupported
references is an unsupported type, check if it needs cleaning or further analysis Unsupported
rightsHolder is an unsupported type, check if it needs cleaning or further analysis Unsupported
type is an unsupported type, check if it needs cleaning or further analysis Unsupported
datasetID is an unsupported type, check if it needs cleaning or further analysis Unsupported
ownerInstitutionCode is an unsupported type, check if it needs cleaning or further analysis Unsupported
informationWithheld is an unsupported type, check if it needs cleaning or further analysis Unsupported
dataGeneralizations is an unsupported type, check if it needs cleaning or further analysis Unsupported
dynamicProperties is an unsupported type, check if it needs cleaning or further analysis Unsupported
recordedByID is an unsupported type, check if it needs cleaning or further analysis Unsupported
organismQuantity is an unsupported type, check if it needs cleaning or further analysis Unsupported
organismQuantityType is an unsupported type, check if it needs cleaning or further analysis Unsupported
reproductiveCondition is an unsupported type, check if it needs cleaning or further analysis Unsupported
caste is an unsupported type, check if it needs cleaning or further analysis Unsupported
behavior is an unsupported type, check if it needs cleaning or further analysis Unsupported
vitality is an unsupported type, check if it needs cleaning or further analysis Unsupported
establishmentMeans is an unsupported type, check if it needs cleaning or further analysis Unsupported
degreeOfEstablishment is an unsupported type, check if it needs cleaning or further analysis Unsupported
pathway is an unsupported type, check if it needs cleaning or further analysis Unsupported
georeferenceVerificationStatus is an unsupported type, check if it needs cleaning or further analysis Unsupported
otherCatalogNumbers is an unsupported type, check if it needs cleaning or further analysis Unsupported
organismID is an unsupported type, check if it needs cleaning or further analysis Unsupported
organismName is an unsupported type, check if it needs cleaning or further analysis Unsupported
organismScope is an unsupported type, check if it needs cleaning or further analysis Unsupported
associatedOrganisms is an unsupported type, check if it needs cleaning or further analysis Unsupported
previousIdentifications is an unsupported type, check if it needs cleaning or further analysis Unsupported
organismRemarks is an unsupported type, check if it needs cleaning or further analysis Unsupported
materialEntityID is an unsupported type, check if it needs cleaning or further analysis Unsupported
materialEntityRemarks is an unsupported type, check if it needs cleaning or further analysis Unsupported
parentEventID is an unsupported type, check if it needs cleaning or further analysis Unsupported
eventType is an unsupported type, check if it needs cleaning or further analysis Unsupported
eventTime is an unsupported type, check if it needs cleaning or further analysis Unsupported
endDayOfYear is an unsupported type, check if it needs cleaning or further analysis Unsupported
samplingProtocol is an unsupported type, check if it needs cleaning or further analysis Unsupported
sampleSizeValue is an unsupported type, check if it needs cleaning or further analysis Unsupported
sampleSizeUnit is an unsupported type, check if it needs cleaning or further analysis Unsupported
eventRemarks is an unsupported type, check if it needs cleaning or further analysis Unsupported
higherGeographyID is an unsupported type, check if it needs cleaning or further analysis Unsupported
municipality is an unsupported type, check if it needs cleaning or further analysis Unsupported
verbatimLocality is an unsupported type, check if it needs cleaning or further analysis Unsupported
verbatimElevation is an unsupported type, check if it needs cleaning or further analysis Unsupported
verticalDatum is an unsupported type, check if it needs cleaning or further analysis Unsupported
minimumDistanceAboveSurfaceInMeters is an unsupported type, check if it needs cleaning or further analysis Unsupported
maximumDistanceAboveSurfaceInMeters is an unsupported type, check if it needs cleaning or further analysis Unsupported
locationAccordingTo is an unsupported type, check if it needs cleaning or further analysis Unsupported
locationRemarks is an unsupported type, check if it needs cleaning or further analysis Unsupported
coordinateUncertaintyInMeters is an unsupported type, check if it needs cleaning or further analysis Unsupported
coordinatePrecision is an unsupported type, check if it needs cleaning or further analysis Unsupported
pointRadiusSpatialFit is an unsupported type, check if it needs cleaning or further analysis Unsupported
verbatimSRS is an unsupported type, check if it needs cleaning or further analysis Unsupported
footprintWKT is an unsupported type, check if it needs cleaning or further analysis Unsupported
footprintSRS is an unsupported type, check if it needs cleaning or further analysis Unsupported
footprintSpatialFit is an unsupported type, check if it needs cleaning or further analysis Unsupported
georeferencedBy is an unsupported type, check if it needs cleaning or further analysis Unsupported
georeferencedDate is an unsupported type, check if it needs cleaning or further analysis Unsupported
geologicalContextID is an unsupported type, check if it needs cleaning or further analysis Unsupported
earliestEonOrLowestEonothem is an unsupported type, check if it needs cleaning or further analysis Unsupported
latestEonOrHighestEonothem is an unsupported type, check if it needs cleaning or further analysis Unsupported
earliestEraOrLowestErathem is an unsupported type, check if it needs cleaning or further analysis Unsupported
latestEraOrHighestErathem is an unsupported type, check if it needs cleaning or further analysis Unsupported
earliestPeriodOrLowestSystem is an unsupported type, check if it needs cleaning or further analysis Unsupported
latestPeriodOrHighestSystem is an unsupported type, check if it needs cleaning or further analysis Unsupported
latestEpochOrHighestSeries is an unsupported type, check if it needs cleaning or further analysis Unsupported
earliestAgeOrLowestStage is an unsupported type, check if it needs cleaning or further analysis Unsupported
latestAgeOrHighestStage is an unsupported type, check if it needs cleaning or further analysis Unsupported
lowestBiostratigraphicZone is an unsupported type, check if it needs cleaning or further analysis Unsupported
highestBiostratigraphicZone is an unsupported type, check if it needs cleaning or further analysis Unsupported
group is an unsupported type, check if it needs cleaning or further analysis Unsupported
formation is an unsupported type, check if it needs cleaning or further analysis Unsupported
member is an unsupported type, check if it needs cleaning or further analysis Unsupported
bed is an unsupported type, check if it needs cleaning or further analysis Unsupported
identificationID is an unsupported type, check if it needs cleaning or further analysis Unsupported
verbatimIdentification is an unsupported type, check if it needs cleaning or further analysis Unsupported
identificationReferences is an unsupported type, check if it needs cleaning or further analysis Unsupported
identificationRemarks is an unsupported type, check if it needs cleaning or further analysis Unsupported
taxonID is an unsupported type, check if it needs cleaning or further analysis Unsupported
scientificNameID is an unsupported type, check if it needs cleaning or further analysis Unsupported
acceptedNameUsageID is an unsupported type, check if it needs cleaning or further analysis Unsupported
originalNameUsageID is an unsupported type, check if it needs cleaning or further analysis Unsupported
nameAccordingToID is an unsupported type, check if it needs cleaning or further analysis Unsupported
taxonConceptID is an unsupported type, check if it needs cleaning or further analysis Unsupported
parentNameUsage is an unsupported type, check if it needs cleaning or further analysis Unsupported
originalNameUsage is an unsupported type, check if it needs cleaning or further analysis Unsupported
nameAccordingTo is an unsupported type, check if it needs cleaning or further analysis Unsupported
namePublishedInYear is an unsupported type, check if it needs cleaning or further analysis Unsupported
superfamily is an unsupported type, check if it needs cleaning or further analysis Unsupported
subfamily is an unsupported type, check if it needs cleaning or further analysis Unsupported
tribe is an unsupported type, check if it needs cleaning or further analysis Unsupported
elevation is an unsupported type, check if it needs cleaning or further analysis Unsupported
elevationAccuracy is an unsupported type, check if it needs cleaning or further analysis Unsupported
depth is an unsupported type, check if it needs cleaning or further analysis Unsupported
depthAccuracy is an unsupported type, check if it needs cleaning or further analysis Unsupported
hasCoordinate is an unsupported type, check if it needs cleaning or further analysis Unsupported
hasGeospatialIssues is an unsupported type, check if it needs cleaning or further analysis Unsupported
taxonKey is an unsupported type, check if it needs cleaning or further analysis Unsupported
acceptedTaxonKey is an unsupported type, check if it needs cleaning or further analysis Unsupported
kingdomKey is an unsupported type, check if it needs cleaning or further analysis Unsupported
phylumKey is an unsupported type, check if it needs cleaning or further analysis Unsupported
classKey is an unsupported type, check if it needs cleaning or further analysis Unsupported
orderKey is an unsupported type, check if it needs cleaning or further analysis Unsupported
typifiedName is an unsupported type, check if it needs cleaning or further analysis Unsupported
relativeOrganismQuantity is an unsupported type, check if it needs cleaning or further analysis Unsupported
projectId is an unsupported type, check if it needs cleaning or further analysis Unsupported

Reproduction

Analysis started2025-01-02 23:28:25.266397
Analysis finished2025-01-02 23:29:39.258926
Duration1 minute and 13.99 seconds
Software versionydata-profiling vv4.12.1
Download configurationconfig.json

Variables

gbifID
Real number (ℝ)

Unique 

Distinct1926389
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1515386844
Minimum1317202449
Maximum4987328269
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:39.368711image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1317202449
5-th percentile1317561912
Q11318996412
median1320788619
Q31322584565
95-th percentile2571403060
Maximum4987328269
Range3670125820
Interquartile range (IQR)3588153

Descriptive statistics

Standard deviation569122690.2
Coefficient of variation (CV)0.3755626442
Kurtosis14.89183286
Mean1515386844
Median Absolute Deviation (MAD)1794060
Skewness3.742982963
Sum2.919224548 × 1015
Variance3.239006365 × 1017
MonotonicityNot monotonic
2025-01-02T18:29:39.442107image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1318343233 1
 
< 0.1%
1321728981 1
 
< 0.1%
1320179422 1
 
< 0.1%
1320179575 1
 
< 0.1%
1321729723 1
 
< 0.1%
1318339663 1
 
< 0.1%
2235823268 1
 
< 0.1%
1318338841 1
 
< 0.1%
1319862563 1
 
< 0.1%
1675901278 1
 
< 0.1%
Other values (1926379) 1926379
> 99.9%
ValueCountFrequency (%)
1317202449 1
< 0.1%
1317202455 1
< 0.1%
1317202456 1
< 0.1%
1317202459 1
< 0.1%
1317202460 1
< 0.1%
ValueCountFrequency (%)
4987328269 1
< 0.1%
4987328266 1
< 0.1%
4987328256 1
< 0.1%
4987328247 1
< 0.1%
4987328207 1
< 0.1%

accessRights
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

bibliographicCitation
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

language
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

license
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:39.505178image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters13484723
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCC0_1_0
2nd rowCC0_1_0
3rd rowCC0_1_0
4th rowCC0_1_0
5th rowCC0_1_0
ValueCountFrequency (%)
cc0_1_0 1926389
100.0%
2025-01-02T18:29:39.615264image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 3852778
28.6%
0 3852778
28.6%
_ 3852778
28.6%
1 1926389
14.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 13484723
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
C 3852778
28.6%
0 3852778
28.6%
_ 3852778
28.6%
1 1926389
14.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 13484723
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
C 3852778
28.6%
0 3852778
28.6%
_ 3852778
28.6%
1 1926389
14.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 13484723
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
C 3852778
28.6%
0 3852778
28.6%
_ 3852778
28.6%
1 1926389
14.3%
Distinct113487
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:39.728143image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length20
Median length20
Mean length20
Min length20

Characters and Unicode

Total characters38527780
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62369 ?
Unique (%)3.2%

Sample

1st row2021-10-06T15:29:00Z
2nd row2024-09-25T16:08:00Z
3rd row2020-01-06T17:42:00Z
4th row2018-09-17T12:46:00Z
5th row2024-09-25T15:32:00Z
ValueCountFrequency (%)
2024-09-25t13:44:00z 9049
 
0.5%
2024-09-25t13:46:00z 8728
 
0.5%
2024-09-25t17:07:00z 8646
 
0.4%
2024-09-25t17:10:00z 8633
 
0.4%
2024-09-25t17:05:00z 8623
 
0.4%
2024-09-25t13:45:00z 8553
 
0.4%
2024-09-25t17:11:00z 8500
 
0.4%
2024-09-25t17:08:00z 8494
 
0.4%
2024-09-25t15:27:00z 8472
 
0.4%
2024-09-25t17:15:00z 8471
 
0.4%
Other values (113477) 1840220
95.5%
2025-01-02T18:29:39.896877image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8971391
23.3%
2 4988496
12.9%
1 4688757
12.2%
- 3852778
10.0%
: 3852778
10.0%
T 1926389
 
5.0%
Z 1926389
 
5.0%
4 1757735
 
4.6%
5 1702085
 
4.4%
9 1536985
 
4.0%
Other values (4) 3323997
 
8.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 38527780
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 8971391
23.3%
2 4988496
12.9%
1 4688757
12.2%
- 3852778
10.0%
: 3852778
10.0%
T 1926389
 
5.0%
Z 1926389
 
5.0%
4 1757735
 
4.6%
5 1702085
 
4.4%
9 1536985
 
4.0%
Other values (4) 3323997
 
8.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 38527780
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 8971391
23.3%
2 4988496
12.9%
1 4688757
12.2%
- 3852778
10.0%
: 3852778
10.0%
T 1926389
 
5.0%
Z 1926389
 
5.0%
4 1757735
 
4.6%
5 1702085
 
4.4%
9 1536985
 
4.0%
Other values (4) 3323997
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 38527780
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 8971391
23.3%
2 4988496
12.9%
1 4688757
12.2%
- 3852778
10.0%
: 3852778
10.0%
T 1926389
 
5.0%
Z 1926389
 
5.0%
4 1757735
 
4.6%
5 1702085
 
4.4%
9 1536985
 
4.0%
Other values (4) 3323997
 
8.6%

publisher
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:39.974615image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length59
Median length59
Mean length59
Min length59

Characters and Unicode

Total characters113656951
Distinct characters21
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNational Museum of Natural History, Smithsonian Institution
2nd rowNational Museum of Natural History, Smithsonian Institution
3rd rowNational Museum of Natural History, Smithsonian Institution
4th rowNational Museum of Natural History, Smithsonian Institution
5th rowNational Museum of Natural History, Smithsonian Institution
ValueCountFrequency (%)
national 1926389
14.3%
museum 1926389
14.3%
of 1926389
14.3%
natural 1926389
14.3%
history 1926389
14.3%
smithsonian 1926389
14.3%
institution 1926389
14.3%
2025-01-02T18:29:40.133877image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 13484723
11.9%
i 11558334
10.2%
11558334
10.2%
o 9631945
 
8.5%
a 9631945
 
8.5%
n 9631945
 
8.5%
s 7705556
 
6.8%
u 7705556
 
6.8%
N 3852778
 
3.4%
m 3852778
 
3.4%
Other values (11) 25043057
22.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 113656951
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t 13484723
11.9%
i 11558334
10.2%
11558334
10.2%
o 9631945
 
8.5%
a 9631945
 
8.5%
n 9631945
 
8.5%
s 7705556
 
6.8%
u 7705556
 
6.8%
N 3852778
 
3.4%
m 3852778
 
3.4%
Other values (11) 25043057
22.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 113656951
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t 13484723
11.9%
i 11558334
10.2%
11558334
10.2%
o 9631945
 
8.5%
a 9631945
 
8.5%
n 9631945
 
8.5%
s 7705556
 
6.8%
u 7705556
 
6.8%
N 3852778
 
3.4%
m 3852778
 
3.4%
Other values (11) 25043057
22.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 113656951
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t 13484723
11.9%
i 11558334
10.2%
11558334
10.2%
o 9631945
 
8.5%
a 9631945
 
8.5%
n 9631945
 
8.5%
s 7705556
 
6.8%
u 7705556
 
6.8%
N 3852778
 
3.4%
m 3852778
 
3.4%
Other values (11) 25043057
22.0%

references
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

rightsHolder
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

type
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

institutionID
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:40.203638image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length29
Median length29
Mean length29
Min length29

Characters and Unicode

Total characters55865281
Distinct characters18
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowurn:lsid:biocol.org:col:34871
2nd rowurn:lsid:biocol.org:col:34871
3rd rowurn:lsid:biocol.org:col:34871
4th rowurn:lsid:biocol.org:col:34871
5th rowurn:lsid:biocol.org:col:34871
ValueCountFrequency (%)
urn:lsid:biocol.org:col:34871 1926389
100.0%
2025-01-02T18:29:40.324143image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 7705556
13.8%
: 7705556
13.8%
l 5779167
 
10.3%
r 3852778
 
6.9%
c 3852778
 
6.9%
i 3852778
 
6.9%
u 1926389
 
3.4%
s 1926389
 
3.4%
d 1926389
 
3.4%
n 1926389
 
3.4%
Other values (8) 15411112
27.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 55865281
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 7705556
13.8%
: 7705556
13.8%
l 5779167
 
10.3%
r 3852778
 
6.9%
c 3852778
 
6.9%
i 3852778
 
6.9%
u 1926389
 
3.4%
s 1926389
 
3.4%
d 1926389
 
3.4%
n 1926389
 
3.4%
Other values (8) 15411112
27.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 55865281
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 7705556
13.8%
: 7705556
13.8%
l 5779167
 
10.3%
r 3852778
 
6.9%
c 3852778
 
6.9%
i 3852778
 
6.9%
u 1926389
 
3.4%
s 1926389
 
3.4%
d 1926389
 
3.4%
n 1926389
 
3.4%
Other values (8) 15411112
27.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 55865281
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 7705556
13.8%
: 7705556
13.8%
l 5779167
 
10.3%
r 3852778
 
6.9%
c 3852778
 
6.9%
i 3852778
 
6.9%
u 1926389
 
3.4%
s 1926389
 
3.4%
d 1926389
 
3.4%
n 1926389
 
3.4%
Other values (8) 15411112
27.6%

collectionID
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:40.399331image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length45
Median length45
Mean length45
Min length45

Characters and Unicode

Total characters86687505
Distinct characters19
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowurn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6
2nd rowurn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6
3rd rowurn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6
4th rowurn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6
5th rowurn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6
ValueCountFrequency (%)
urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 1926389
100.0%
2025-01-02T18:29:40.526361image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
d 9631945
11.1%
1 7705556
 
8.9%
- 7705556
 
8.9%
c 5779167
 
6.7%
2 5779167
 
6.7%
u 5779167
 
6.7%
4 5779167
 
6.7%
8 5779167
 
6.7%
f 5779167
 
6.7%
7 3852778
 
4.4%
Other values (9) 23116668
26.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 86687505
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
d 9631945
11.1%
1 7705556
 
8.9%
- 7705556
 
8.9%
c 5779167
 
6.7%
2 5779167
 
6.7%
u 5779167
 
6.7%
4 5779167
 
6.7%
8 5779167
 
6.7%
f 5779167
 
6.7%
7 3852778
 
4.4%
Other values (9) 23116668
26.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 86687505
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
d 9631945
11.1%
1 7705556
 
8.9%
- 7705556
 
8.9%
c 5779167
 
6.7%
2 5779167
 
6.7%
u 5779167
 
6.7%
4 5779167
 
6.7%
8 5779167
 
6.7%
f 5779167
 
6.7%
7 3852778
 
4.4%
Other values (9) 23116668
26.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 86687505
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
d 9631945
11.1%
1 7705556
 
8.9%
- 7705556
 
8.9%
c 5779167
 
6.7%
2 5779167
 
6.7%
u 5779167
 
6.7%
4 5779167
 
6.7%
8 5779167
 
6.7%
f 5779167
 
6.7%
7 3852778
 
4.4%
Other values (9) 23116668
26.7%

datasetID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

institutionCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:40.572484image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters7705556
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowUSNM
2nd rowUSNM
3rd rowUSNM
4th rowUSNM
5th rowUSNM
ValueCountFrequency (%)
usnm 1926389
100.0%
2025-01-02T18:29:40.669597image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
U 1926389
25.0%
S 1926389
25.0%
N 1926389
25.0%
M 1926389
25.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7705556
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
U 1926389
25.0%
S 1926389
25.0%
N 1926389
25.0%
M 1926389
25.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7705556
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
U 1926389
25.0%
S 1926389
25.0%
N 1926389
25.0%
M 1926389
25.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7705556
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
U 1926389
25.0%
S 1926389
25.0%
N 1926389
25.0%
M 1926389
25.0%

collectionCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:40.704682image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters3852778
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowIZ
2nd rowIZ
3rd rowIZ
4th rowIZ
5th rowIZ
ValueCountFrequency (%)
iz 1926389
100.0%
2025-01-02T18:29:40.790699image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
I 1926389
50.0%
Z 1926389
50.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3852778
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
I 1926389
50.0%
Z 1926389
50.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3852778
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
I 1926389
50.0%
Z 1926389
50.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3852778
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
I 1926389
50.0%
Z 1926389
50.0%

datasetName
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:40.853653image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length19
Median length19
Mean length19
Min length19

Characters and Unicode

Total characters36601391
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNMNH Extant Biology
2nd rowNMNH Extant Biology
3rd rowNMNH Extant Biology
4th rowNMNH Extant Biology
5th rowNMNH Extant Biology
ValueCountFrequency (%)
nmnh 1926389
33.3%
extant 1926389
33.3%
biology 1926389
33.3%
2025-01-02T18:29:40.992800image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 3852778
 
10.5%
t 3852778
 
10.5%
3852778
 
10.5%
o 3852778
 
10.5%
H 1926389
 
5.3%
E 1926389
 
5.3%
M 1926389
 
5.3%
x 1926389
 
5.3%
a 1926389
 
5.3%
B 1926389
 
5.3%
Other values (5) 9631945
26.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 36601391
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N 3852778
 
10.5%
t 3852778
 
10.5%
3852778
 
10.5%
o 3852778
 
10.5%
H 1926389
 
5.3%
E 1926389
 
5.3%
M 1926389
 
5.3%
x 1926389
 
5.3%
a 1926389
 
5.3%
B 1926389
 
5.3%
Other values (5) 9631945
26.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 36601391
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N 3852778
 
10.5%
t 3852778
 
10.5%
3852778
 
10.5%
o 3852778
 
10.5%
H 1926389
 
5.3%
E 1926389
 
5.3%
M 1926389
 
5.3%
x 1926389
 
5.3%
a 1926389
 
5.3%
B 1926389
 
5.3%
Other values (5) 9631945
26.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 36601391
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N 3852778
 
10.5%
t 3852778
 
10.5%
3852778
 
10.5%
o 3852778
 
10.5%
H 1926389
 
5.3%
E 1926389
 
5.3%
M 1926389
 
5.3%
x 1926389
 
5.3%
a 1926389
 
5.3%
B 1926389
 
5.3%
Other values (5) 9631945
26.3%

ownerInstitutionCode
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:41.064042image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length19
Median length18
Mean length18.00144052
Min length17

Characters and Unicode

Total characters34677777
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPRESERVED_SPECIMEN
2nd rowPRESERVED_SPECIMEN
3rd rowPRESERVED_SPECIMEN
4th rowPRESERVED_SPECIMEN
5th rowPRESERVED_SPECIMEN
ValueCountFrequency (%)
preserved_specimen 1922252
99.8%
machine_observation 3456
 
0.2%
human_observation 681
 
< 0.1%
2025-01-02T18:29:41.194170image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 9618853
27.7%
S 3848641
11.1%
R 3848641
11.1%
P 3844504
 
11.1%
N 1930526
 
5.6%
I 1929845
 
5.6%
V 1926389
 
5.6%
M 1926389
 
5.6%
_ 1926389
 
5.6%
C 1925708
 
5.6%
Other values (7) 1951892
 
5.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 34677777
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
E 9618853
27.7%
S 3848641
11.1%
R 3848641
11.1%
P 3844504
 
11.1%
N 1930526
 
5.6%
I 1929845
 
5.6%
V 1926389
 
5.6%
M 1926389
 
5.6%
_ 1926389
 
5.6%
C 1925708
 
5.6%
Other values (7) 1951892
 
5.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 34677777
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
E 9618853
27.7%
S 3848641
11.1%
R 3848641
11.1%
P 3844504
 
11.1%
N 1930526
 
5.6%
I 1929845
 
5.6%
V 1926389
 
5.6%
M 1926389
 
5.6%
_ 1926389
 
5.6%
C 1925708
 
5.6%
Other values (7) 1951892
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 34677777
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
E 9618853
27.7%
S 3848641
11.1%
R 3848641
11.1%
P 3844504
 
11.1%
N 1930526
 
5.6%
I 1929845
 
5.6%
V 1926389
 
5.6%
M 1926389
 
5.6%
_ 1926389
 
5.6%
C 1925708
 
5.6%
Other values (7) 1951892
 
5.6%

informationWithheld
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

dataGeneralizations
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

dynamicProperties
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

occurrenceID
Text

Unique 

Distinct1926389
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:42.227632image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length63
Median length63
Mean length63
Min length63

Characters and Unicode

Total characters121362507
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1926389 ?
Unique (%)100.0%

Sample

1st rowhttp://n2t.net/ark:/65665/3c831e8df-8799-47a1-8dcf-bcb0b77fd3e3
2nd rowhttp://n2t.net/ark:/65665/383ab647e-23a7-4086-b71e-36212ccc0eb2
3rd rowhttp://n2t.net/ark:/65665/383adbf6e-f769-4dc3-8bef-550530af49ee
4th rowhttp://n2t.net/ark:/65665/3c83aad38-c935-46fa-96c3-e450ebb169cf
5th rowhttp://n2t.net/ark:/65665/383b126a6-bf3a-4908-bc33-e4435555fcc5
ValueCountFrequency (%)
http://n2t.net/ark:/65665/3c843fd56-7874-4858-b938-14fdfcb5544c 1
 
< 0.1%
http://n2t.net/ark:/65665/33275786b-f1fe-4add-972f-33ff5c507828 1
 
< 0.1%
http://n2t.net/ark:/65665/3c831e8df-8799-47a1-8dcf-bcb0b77fd3e3 1
 
< 0.1%
http://n2t.net/ark:/65665/383ab647e-23a7-4086-b71e-36212ccc0eb2 1
 
< 0.1%
http://n2t.net/ark:/65665/383adbf6e-f769-4dc3-8bef-550530af49ee 1
 
< 0.1%
http://n2t.net/ark:/65665/375bb0af4-5d38-4cd6-b8a0-08c73948a463 1
 
< 0.1%
http://n2t.net/ark:/65665/332517de1-2fda-4aa0-b70d-aa8e1a04dc45 1
 
< 0.1%
http://n2t.net/ark:/65665/332534e23-cc82-471b-bb9a-2a5c6c99975c 1
 
< 0.1%
http://n2t.net/ark:/65665/375c2a833-b1da-4c95-81a0-59bf9ffc8e2a 1
 
< 0.1%
http://n2t.net/ark:/65665/375c9b917-c771-448b-8a68-0719d4403f61 1
 
< 0.1%
Other values (1926379) 1926379
> 99.9%
2025-01-02T18:29:43.498872image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 9631945
 
7.9%
6 9394374
 
7.7%
- 7705556
 
6.3%
t 7705556
 
6.3%
5 7461163
 
6.1%
a 6018592
 
5.0%
3 5539460
 
4.6%
e 5537684
 
4.6%
2 5537375
 
4.6%
4 5534532
 
4.6%
Other values (16) 51296270
42.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 121362507
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
/ 9631945
 
7.9%
6 9394374
 
7.7%
- 7705556
 
6.3%
t 7705556
 
6.3%
5 7461163
 
6.1%
a 6018592
 
5.0%
3 5539460
 
4.6%
e 5537684
 
4.6%
2 5537375
 
4.6%
4 5534532
 
4.6%
Other values (16) 51296270
42.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 121362507
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
/ 9631945
 
7.9%
6 9394374
 
7.7%
- 7705556
 
6.3%
t 7705556
 
6.3%
5 7461163
 
6.1%
a 6018592
 
5.0%
3 5539460
 
4.6%
e 5537684
 
4.6%
2 5537375
 
4.6%
4 5534532
 
4.6%
Other values (16) 51296270
42.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 121362507
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
/ 9631945
 
7.9%
6 9394374
 
7.7%
- 7705556
 
6.3%
t 7705556
 
6.3%
5 7461163
 
6.1%
a 6018592
 
5.0%
3 5539460
 
4.6%
e 5537684
 
4.6%
2 5537375
 
4.6%
4 5534532
 
4.6%
Other values (16) 51296270
42.3%
Distinct1355389
Distinct (%)70.4%
Missing5
Missing (%)< 0.1%
Memory size14.7 MiB
2025-01-02T18:29:44.283476image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length16
Median length11
Mean length11.0374022
Min length6

Characters and Unicode

Total characters21262275
Distinct characters63
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1024472 ?
Unique (%)53.2%

Sample

1st rowUSNM 1119015
2nd rowUSNM 55168
3rd rowUSNM 52536
4th rowUSNM E40844
5th rowUSNM 241160
ValueCountFrequency (%)
usnm 1926384
50.0%
31
 
< 0.1%
284908 16
 
< 0.1%
653324 13
 
< 0.1%
5357 11
 
< 0.1%
859036 10
 
< 0.1%
224878 10
 
< 0.1%
22869 10
 
< 0.1%
15490 10
 
< 0.1%
49373 9
 
< 0.1%
Other values (1352145) 1926297
50.0%
2025-01-02T18:29:45.118310image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
M 1928503
 
9.1%
U 1926491
 
9.1%
1926417
 
9.1%
N 1926384
 
9.1%
S 1926384
 
9.1%
1 1809860
 
8.5%
2 1247564
 
5.9%
3 1147863
 
5.4%
4 1110830
 
5.2%
5 1088353
 
5.1%
Other values (53) 5223626
24.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 21262275
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
M 1928503
 
9.1%
U 1926491
 
9.1%
1926417
 
9.1%
N 1926384
 
9.1%
S 1926384
 
9.1%
1 1809860
 
8.5%
2 1247564
 
5.9%
3 1147863
 
5.4%
4 1110830
 
5.2%
5 1088353
 
5.1%
Other values (53) 5223626
24.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 21262275
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
M 1928503
 
9.1%
U 1926491
 
9.1%
1926417
 
9.1%
N 1926384
 
9.1%
S 1926384
 
9.1%
1 1809860
 
8.5%
2 1247564
 
5.9%
3 1147863
 
5.4%
4 1110830
 
5.2%
5 1088353
 
5.1%
Other values (53) 5223626
24.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 21262275
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
M 1928503
 
9.1%
U 1926491
 
9.1%
1926417
 
9.1%
N 1926384
 
9.1%
S 1926384
 
9.1%
1 1809860
 
8.5%
2 1247564
 
5.9%
3 1147863
 
5.4%
4 1110830
 
5.2%
5 1088353
 
5.1%
Other values (53) 5223626
24.6%

recordNumber
Text

Missing 

Distinct119495
Distinct (%)98.1%
Missing1804636
Missing (%)93.7%
Memory size14.7 MiB
2025-01-02T18:29:45.328528image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length87
Median length14
Mean length13.17353166
Min length1

Characters and Unicode

Total characters1603917
Distinct characters81
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique118866 ?
Unique (%)97.6%

Sample

1st rowUSNPC # 001298
2nd rowFPlrv_430
3rd rowH-2284
4th rowUSNPC # 066527
5th rowUSNPC # 009815
ValueCountFrequency (%)
88145
28.7%
usnpc 88064
28.6%
ullz 5209
 
1.7%
rh 1566
 
0.5%
k-rh 1555
 
0.5%
ce16007-event 223
 
0.1%
2208 102
 
< 0.1%
1430 92
 
< 0.1%
1513 80
 
< 0.1%
beauty 75
 
< 0.1%
Other values (119414) 122317
39.8%
2025-01-02T18:29:45.606569image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
185675
 
11.6%
0 161175
 
10.0%
C 97557
 
6.1%
S 95231
 
5.9%
U 94869
 
5.9%
P 94146
 
5.9%
N 93453
 
5.8%
# 88221
 
5.5%
1 83004
 
5.2%
2 65151
 
4.1%
Other values (71) 545435
34.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1603917
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
185675
 
11.6%
0 161175
 
10.0%
C 97557
 
6.1%
S 95231
 
5.9%
U 94869
 
5.9%
P 94146
 
5.9%
N 93453
 
5.8%
# 88221
 
5.5%
1 83004
 
5.2%
2 65151
 
4.1%
Other values (71) 545435
34.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1603917
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
185675
 
11.6%
0 161175
 
10.0%
C 97557
 
6.1%
S 95231
 
5.9%
U 94869
 
5.9%
P 94146
 
5.9%
N 93453
 
5.8%
# 88221
 
5.5%
1 83004
 
5.2%
2 65151
 
4.1%
Other values (71) 545435
34.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1603917
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
185675
 
11.6%
0 161175
 
10.0%
C 97557
 
6.1%
S 95231
 
5.9%
U 94869
 
5.9%
P 94146
 
5.9%
N 93453
 
5.8%
# 88221
 
5.5%
1 83004
 
5.2%
2 65151
 
4.1%
Other values (71) 545435
34.0%

recordedBy
Text

Missing 

Distinct37540
Distinct (%)3.2%
Missing764111
Missing (%)39.7%
Memory size14.7 MiB
2025-01-02T18:29:45.757804image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length24973
Median length156
Mean length23.05848945
Min length1

Characters and Unicode

Total characters26800375
Distinct characters99
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16583 ?
Unique (%)1.4%

Sample

1st rowVIMS for BLM/ MMS
2nd rowLgl Ecological Research Associates/ Environmental Science And Engineering For BLM/ MMS
3rd rowUniversity of Southern California
4th rowUnited States Fish Commission
5th rowUnited States Fish Commission
ValueCountFrequency (%)
mms 181011
 
4.2%
blm 181009
 
4.2%
for 178053
 
4.2%
fish 168374
 
3.9%
united 164153
 
3.8%
states 163489
 
3.8%
commission 157086
 
3.7%
149579
 
3.5%
of 101785
 
2.4%
j 101464
 
2.4%
Other values (19944) 2737854
63.9%
2025-01-02T18:29:45.999934image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3119229
 
11.6%
e 2082531
 
7.8%
i 1879315
 
7.0%
n 1616253
 
6.0%
t 1592699
 
5.9%
o 1549731
 
5.8%
s 1530046
 
5.7%
a 1499469
 
5.6%
r 1221275
 
4.6%
M 808831
 
3.0%
Other values (89) 9900996
36.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 26800375
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
3119229
 
11.6%
e 2082531
 
7.8%
i 1879315
 
7.0%
n 1616253
 
6.0%
t 1592699
 
5.9%
o 1549731
 
5.8%
s 1530046
 
5.7%
a 1499469
 
5.6%
r 1221275
 
4.6%
M 808831
 
3.0%
Other values (89) 9900996
36.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 26800375
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
3119229
 
11.6%
e 2082531
 
7.8%
i 1879315
 
7.0%
n 1616253
 
6.0%
t 1592699
 
5.9%
o 1549731
 
5.8%
s 1530046
 
5.7%
a 1499469
 
5.6%
r 1221275
 
4.6%
M 808831
 
3.0%
Other values (89) 9900996
36.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 26800375
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
3119229
 
11.6%
e 2082531
 
7.8%
i 1879315
 
7.0%
n 1616253
 
6.0%
t 1592699
 
5.9%
o 1549731
 
5.8%
s 1530046
 
5.7%
a 1499469
 
5.6%
r 1221275
 
4.6%
M 808831
 
3.0%
Other values (89) 9900996
36.9%

recordedByID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

individualCount
Real number (ℝ)

Skewed 

Distinct1067
Distinct (%)0.1%
Missing156
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean6.257458989
Minimum0
Maximum19634
Zeros7832
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:46.075591image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q34
95-th percentile16
Maximum19634
Range19634
Interquartile range (IQR)3

Descriptive statistics

Standard deviation57.22162707
Coefficient of variation (CV)9.14454688
Kurtosis18624.70278
Mean6.257458989
Median Absolute Deviation (MAD)0
Skewness100.3827769
Sum12053324
Variance3274.314605
MonotonicityNot monotonic
2025-01-02T18:29:46.148371image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 995782
51.7%
2 289568
 
15.0%
3 135771
 
7.0%
4 99105
 
5.1%
5 73928
 
3.8%
6 51745
 
2.7%
10 38952
 
2.0%
7 31375
 
1.6%
8 30170
 
1.6%
9 18501
 
1.0%
Other values (1057) 161336
 
8.4%
ValueCountFrequency (%)
0 7832
 
0.4%
1 995782
51.7%
2 289568
 
15.0%
3 135771
 
7.0%
4 99105
 
5.1%
ValueCountFrequency (%)
19634 1
< 0.1%
15284 1
< 0.1%
12500 1
< 0.1%
11404 1
< 0.1%
10000 2
< 0.1%

organismQuantity
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

organismQuantityType
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

sex
Text

Missing 

Distinct3
Distinct (%)< 0.1%
Missing1802976
Missing (%)93.6%
Memory size14.7 MiB
2025-01-02T18:29:46.210518image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length6
Mean length5.129864763
Min length4

Characters and Unicode

Total characters633092
Distinct characters12
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFEMALE
2nd rowFEMALE
3rd rowMALE
4th rowMALE
5th rowFEMALE
ValueCountFrequency (%)
female 68541
55.5%
male 54610
44.2%
hermaphrodite 262
 
0.2%
2025-01-02T18:29:46.317804image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 192216
30.4%
M 123413
19.5%
A 123413
19.5%
L 123151
19.5%
F 68541
 
10.8%
H 524
 
0.1%
R 524
 
0.1%
P 262
 
< 0.1%
O 262
 
< 0.1%
D 262
 
< 0.1%
Other values (2) 524
 
0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 633092
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
E 192216
30.4%
M 123413
19.5%
A 123413
19.5%
L 123151
19.5%
F 68541
 
10.8%
H 524
 
0.1%
R 524
 
0.1%
P 262
 
< 0.1%
O 262
 
< 0.1%
D 262
 
< 0.1%
Other values (2) 524
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 633092
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
E 192216
30.4%
M 123413
19.5%
A 123413
19.5%
L 123151
19.5%
F 68541
 
10.8%
H 524
 
0.1%
R 524
 
0.1%
P 262
 
< 0.1%
O 262
 
< 0.1%
D 262
 
< 0.1%
Other values (2) 524
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 633092
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
E 192216
30.4%
M 123413
19.5%
A 123413
19.5%
L 123151
19.5%
F 68541
 
10.8%
H 524
 
0.1%
R 524
 
0.1%
P 262
 
< 0.1%
O 262
 
< 0.1%
D 262
 
< 0.1%
Other values (2) 524
 
0.1%

lifeStage
Text

Missing 

Distinct19
Distinct (%)0.1%
Missing1888852
Missing (%)98.1%
Memory size14.7 MiB
2025-01-02T18:29:46.380682image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length9
Median length8
Mean length6.544262994
Min length3

Characters and Unicode

Total characters245652
Distinct characters35
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowLarva
2nd rowJuvenile
3rd rowLarva
4th rowJuvenile
5th rowLarva
ValueCountFrequency (%)
juvenile 18119
48.3%
adult 9874
26.3%
larva 7695
20.5%
immature 711
 
1.9%
mature 247
 
0.7%
subadult 244
 
0.7%
egg 142
 
0.4%
megalopa 131
 
0.3%
veliger 126
 
0.3%
zoea 95
 
0.3%
Other values (9) 153
 
0.4%
2025-01-02T18:29:46.501793image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 37685
15.3%
u 29584
12.0%
l 28565
11.6%
v 25814
10.5%
i 18319
7.5%
n 18135
7.4%
J 18119
7.4%
a 17028
6.9%
t 11097
 
4.5%
d 10135
 
4.1%
Other values (25) 31171
12.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 245652
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 37685
15.3%
u 29584
12.0%
l 28565
11.6%
v 25814
10.5%
i 18319
7.5%
n 18135
7.4%
J 18119
7.4%
a 17028
6.9%
t 11097
 
4.5%
d 10135
 
4.1%
Other values (25) 31171
12.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 245652
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 37685
15.3%
u 29584
12.0%
l 28565
11.6%
v 25814
10.5%
i 18319
7.5%
n 18135
7.4%
J 18119
7.4%
a 17028
6.9%
t 11097
 
4.5%
d 10135
 
4.1%
Other values (25) 31171
12.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 245652
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 37685
15.3%
u 29584
12.0%
l 28565
11.6%
v 25814
10.5%
i 18319
7.5%
n 18135
7.4%
J 18119
7.4%
a 17028
6.9%
t 11097
 
4.5%
d 10135
 
4.1%
Other values (25) 31171
12.7%

reproductiveCondition
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

caste
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

behavior
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

vitality
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

establishmentMeans
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

degreeOfEstablishment
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

pathway
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

georeferenceVerificationStatus
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:46.558645image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length7
Mean length6.99788049
Min length6

Characters and Unicode

Total characters13480640
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st rowPRESENT
2nd rowPRESENT
3rd rowPRESENT
4th rowPRESENT
5th rowPRESENT
ValueCountFrequency (%)
present 1922298
99.8%
absent 4089
 
0.2%
1993-09-09 1
 
< 0.1%
1938-09-22 1
 
< 0.1%
2025-01-02T18:29:46.677092image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 3848685
28.5%
N 1926387
14.3%
S 1926387
14.3%
T 1926387
14.3%
P 1922298
14.3%
R 1922298
14.3%
A 4089
 
< 0.1%
B 4089
 
< 0.1%
9 6
 
< 0.1%
- 4
 
< 0.1%
Other values (5) 10
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 13480640
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
E 3848685
28.5%
N 1926387
14.3%
S 1926387
14.3%
T 1926387
14.3%
P 1922298
14.3%
R 1922298
14.3%
A 4089
 
< 0.1%
B 4089
 
< 0.1%
9 6
 
< 0.1%
- 4
 
< 0.1%
Other values (5) 10
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 13480640
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
E 3848685
28.5%
N 1926387
14.3%
S 1926387
14.3%
T 1926387
14.3%
P 1922298
14.3%
R 1922298
14.3%
A 4089
 
< 0.1%
B 4089
 
< 0.1%
9 6
 
< 0.1%
- 4
 
< 0.1%
Other values (5) 10
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 13480640
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
E 3848685
28.5%
N 1926387
14.3%
S 1926387
14.3%
T 1926387
14.3%
P 1922298
14.3%
R 1922298
14.3%
A 4089
 
< 0.1%
B 4089
 
< 0.1%
9 6
 
< 0.1%
- 4
 
< 0.1%
Other values (5) 10
 
< 0.1%
Distinct527
Distinct (%)< 0.1%
Missing1860
Missing (%)0.1%
Memory size14.7 MiB
2025-01-02T18:29:46.745571image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length167
Median length157
Mean length10.12228031
Min length3

Characters and Unicode

Total characters19480622
Distinct characters53
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique212 ?
Unique (%)< 0.1%

Sample

1st rowAlcohol (Ethanol)
2nd rowDry
3rd rowAlcohol (Ethanol)
4th rowDry
5th rowDry
ValueCountFrequency (%)
ethanol 907116
30.8%
dry 902340
30.6%
alcohol 897623
30.5%
slide 129646
 
4.4%
19548
 
0.7%
95 16839
 
0.6%
formalin 12585
 
0.4%
biorepository 12373
 
0.4%
isopropyl 10055
 
0.3%
sorting 6036
 
0.2%
Other values (40) 31872
 
1.1%
2025-01-02T18:29:46.899713image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
l 2866425
14.7%
o 2797181
14.4%
h 1806304
 
9.3%
1021504
 
5.2%
r 954327
 
4.9%
t 939558
 
4.8%
n 936852
 
4.8%
a 925741
 
4.8%
y 923985
 
4.7%
E 913016
 
4.7%
Other values (43) 5395729
27.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19480622
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
l 2866425
14.7%
o 2797181
14.4%
h 1806304
 
9.3%
1021504
 
5.2%
r 954327
 
4.9%
t 939558
 
4.8%
n 936852
 
4.8%
a 925741
 
4.8%
y 923985
 
4.7%
E 913016
 
4.7%
Other values (43) 5395729
27.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19480622
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
l 2866425
14.7%
o 2797181
14.4%
h 1806304
 
9.3%
1021504
 
5.2%
r 954327
 
4.9%
t 939558
 
4.8%
n 936852
 
4.8%
a 925741
 
4.8%
y 923985
 
4.7%
E 913016
 
4.7%
Other values (43) 5395729
27.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19480622
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
l 2866425
14.7%
o 2797181
14.4%
h 1806304
 
9.3%
1021504
 
5.2%
r 954327
 
4.9%
t 939558
 
4.8%
n 936852
 
4.8%
a 925741
 
4.8%
y 923985
 
4.7%
E 913016
 
4.7%
Other values (43) 5395729
27.7%

disposition
Real number (ℝ)

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean258.5
Minimum252
Maximum265
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:46.954628image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum252
5-th percentile252.65
Q1255.25
median258.5
Q3261.75
95-th percentile264.35
Maximum265
Range13
Interquartile range (IQR)6.5

Descriptive statistics

Standard deviation9.192388155
Coefficient of variation (CV)0.03556049577
Kurtosisnan
Mean258.5
Median Absolute Deviation (MAD)6.5
Skewnessnan
Sum517
Variance84.5
MonotonicityStrictly increasing
2025-01-02T18:29:47.007070image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
252 1
 
< 0.1%
265 1
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
252 1
< 0.1%
265 1
< 0.1%
ValueCountFrequency (%)
265 1
< 0.1%
252 1
< 0.1%

associatedOccurrences
Real number (ℝ)

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean258.5
Minimum252
Maximum265
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:47.058374image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum252
5-th percentile252.65
Q1255.25
median258.5
Q3261.75
95-th percentile264.35
Maximum265
Range13
Interquartile range (IQR)6.5

Descriptive statistics

Standard deviation9.192388155
Coefficient of variation (CV)0.03556049577
Kurtosisnan
Mean258.5
Median Absolute Deviation (MAD)6.5
Skewnessnan
Sum517
Variance84.5
MonotonicityStrictly increasing
2025-01-02T18:29:47.107182image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
252 1
 
< 0.1%
265 1
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
252 1
< 0.1%
265 1
< 0.1%
ValueCountFrequency (%)
265 1
< 0.1%
252 1
< 0.1%

associatedReferences
Real number (ℝ)

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean1965.5
Minimum1938
Maximum1993
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:47.150967image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1938
5-th percentile1940.75
Q11951.75
median1965.5
Q31979.25
95-th percentile1990.25
Maximum1993
Range55
Interquartile range (IQR)27.5

Descriptive statistics

Standard deviation38.89087297
Coefficient of variation (CV)0.01978675806
Kurtosisnan
Mean1965.5
Median Absolute Deviation (MAD)27.5
Skewnessnan
Sum3931
Variance1512.5
MonotonicityStrictly decreasing
2025-01-02T18:29:47.197517image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
1993 1
 
< 0.1%
1938 1
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
1938 1
< 0.1%
1993 1
< 0.1%
ValueCountFrequency (%)
1993 1
< 0.1%
1938 1
< 0.1%

associatedSequences
Text

Missing 

Distinct5098
Distinct (%)99.5%
Missing1921265
Missing (%)99.7%
Memory size14.7 MiB
2025-01-02T18:29:47.289554image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length1349
Median length49
Mean length85.4980484
Min length1

Characters and Unicode

Total characters438092
Distinct characters61
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5082 ?
Unique (%)99.2%

Sample

1st rowhttps://www.ncbi.nlm.nih.gov/gquery?term=AY426351;https://www.ncbi.nlm.nih.gov/gquery?term=AY379442;https://www.ncbi.nlm.nih.gov/gquery?term=AY426385
2nd rowhttps://www.ncbi.nlm.nih.gov/gquery?term=MH825989
3rd rowhttps://www.ncbi.nlm.nih.gov/gquery?term=MT223244
4th rowhttps://www.ncbi.nlm.nih.gov/gquery?term=MH826372
5th rowhttps://www.ncbi.nlm.nih.gov/gquery?term=KT792656
ValueCountFrequency (%)
https://www.ncbi.nlm.nih.gov/gquery?term=km521547 12
 
0.2%
https://www.ncbi.nlm.nih.gov/gquery?term=ku285912 2
 
< 0.1%
https://www.ncbi.nlm.nih.gov/gquery?term=kx832080 2
 
< 0.1%
https://www.ncbi.nlm.nih.gov/gquery?term=mh244118 2
 
< 0.1%
https://www.ncbi.nlm.nih.gov/gquery?term=srr9613700 2
 
< 0.1%
https://www.ncbi.nlm.nih.gov/gquery?term=kp739770 2
 
< 0.1%
https://www.ncbi.nlm.nih.gov/gquery?term=jq307001 2
 
< 0.1%
9 2
 
< 0.1%
https://www.ncbi.nlm.nih.gov/gquery?term=mk246580;https://www.ncbi.nlm.nih.gov/gquery?term=mk246484 2
 
< 0.1%
https://www.ncbi.nlm.nih.gov/gquery?term=mk246581;https://www.ncbi.nlm.nih.gov/gquery?term=mk246487 2
 
< 0.1%
Other values (5088) 5094
99.4%
2025-01-02T18:29:47.538842image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 35419
 
8.1%
n 26562
 
6.1%
w 26562
 
6.1%
/ 26562
 
6.1%
t 26562
 
6.1%
i 17708
 
4.0%
h 17708
 
4.0%
g 17708
 
4.0%
m 17708
 
4.0%
r 17708
 
4.0%
Other values (51) 207885
47.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 438092
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
. 35419
 
8.1%
n 26562
 
6.1%
w 26562
 
6.1%
/ 26562
 
6.1%
t 26562
 
6.1%
i 17708
 
4.0%
h 17708
 
4.0%
g 17708
 
4.0%
m 17708
 
4.0%
r 17708
 
4.0%
Other values (51) 207885
47.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 438092
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
. 35419
 
8.1%
n 26562
 
6.1%
w 26562
 
6.1%
/ 26562
 
6.1%
t 26562
 
6.1%
i 17708
 
4.0%
h 17708
 
4.0%
g 17708
 
4.0%
m 17708
 
4.0%
r 17708
 
4.0%
Other values (51) 207885
47.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 438092
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
. 35419
 
8.1%
n 26562
 
6.1%
w 26562
 
6.1%
/ 26562
 
6.1%
t 26562
 
6.1%
i 17708
 
4.0%
h 17708
 
4.0%
g 17708
 
4.0%
m 17708
 
4.0%
r 17708
 
4.0%
Other values (51) 207885
47.5%

associatedTaxa
Real number (ℝ)

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean15.5
Minimum9
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:47.602111image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile9.65
Q112.25
median15.5
Q318.75
95-th percentile21.35
Maximum22
Range13
Interquartile range (IQR)6.5

Descriptive statistics

Standard deviation9.192388155
Coefficient of variation (CV)0.5930573004
Kurtosisnan
Mean15.5
Median Absolute Deviation (MAD)6.5
Skewnessnan
Sum31
Variance84.5
MonotonicityStrictly increasing
2025-01-02T18:29:47.650864image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
9 1
 
< 0.1%
22 1
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
9 1
< 0.1%
22 1
< 0.1%
ValueCountFrequency (%)
22 1
< 0.1%
9 1
< 0.1%

otherCatalogNumbers
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

occurrenceRemarks
Text

Missing 

Distinct384890
Distinct (%)49.2%
Missing1144481
Missing (%)59.4%
Memory size14.7 MiB
2025-01-02T18:29:47.938020image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length48981
Median length1371
Mean length61.50956506
Min length1

Characters and Unicode

Total characters48094821
Distinct characters133
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique322676 ?
Unique (%)41.3%

Sample

1st rowJewett.; Stearns.
2nd rowBartsch
3rd row15 Nov. 1973; Jones, Dawson, del Rosario; Fitzgerald; NMNH-STRI Survey
4th rowU. S. B. Fish
5th rowC.R. Laws
ValueCountFrequency (%)
coll 143199
 
2.1%
of 115369
 
1.7%
and 111363
 
1.7%
a 107288
 
1.6%
by 89612
 
1.3%
87811
 
1.3%
2 65618
 
1.0%
3 63129
 
0.9%
was 62154
 
0.9%
formalin 58892
 
0.9%
Other values (238106) 5777747
86.5%
2025-01-02T18:29:48.414921image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5892887
 
12.3%
e 2965997
 
6.2%
o 2602001
 
5.4%
a 2414749
 
5.0%
i 2010061
 
4.2%
t 1978195
 
4.1%
n 1975689
 
4.1%
r 1877425
 
3.9%
s 1858443
 
3.9%
l 1812957
 
3.8%
Other values (123) 22706417
47.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 48094821
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
5892887
 
12.3%
e 2965997
 
6.2%
o 2602001
 
5.4%
a 2414749
 
5.0%
i 2010061
 
4.2%
t 1978195
 
4.1%
n 1975689
 
4.1%
r 1877425
 
3.9%
s 1858443
 
3.9%
l 1812957
 
3.8%
Other values (123) 22706417
47.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 48094821
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
5892887
 
12.3%
e 2965997
 
6.2%
o 2602001
 
5.4%
a 2414749
 
5.0%
i 2010061
 
4.2%
t 1978195
 
4.1%
n 1975689
 
4.1%
r 1877425
 
3.9%
s 1858443
 
3.9%
l 1812957
 
3.8%
Other values (123) 22706417
47.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 48094821
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
5892887
 
12.3%
e 2965997
 
6.2%
o 2602001
 
5.4%
a 2414749
 
5.0%
i 2010061
 
4.2%
t 1978195
 
4.1%
n 1975689
 
4.1%
r 1877425
 
3.9%
s 1858443
 
3.9%
l 1812957
 
3.8%
Other values (123) 22706417
47.2%

organismID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

organismName
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

organismScope
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

associatedOrganisms
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

previousIdentifications
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

organismRemarks
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

materialEntityID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

materialEntityRemarks
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

verbatimLabel
Text

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:48.605717image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length62
Median length48.5
Mean length48.5
Min length35

Characters and Unicode

Total characters97
Distinct characters28
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowNorth America, North Pacific Ocean, Gulf Of California, Mexico
2nd rowNorth America, United States, Texas
ValueCountFrequency (%)
north 3
21.4%
america 2
14.3%
pacific 1
 
7.1%
ocean 1
 
7.1%
gulf 1
 
7.1%
of 1
 
7.1%
california 1
 
7.1%
mexico 1
 
7.1%
united 1
 
7.1%
states 1
 
7.1%
2025-01-02T18:29:48.727262image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
12.4%
i 8
 
8.2%
a 8
 
8.2%
e 7
 
7.2%
c 6
 
6.2%
t 6
 
6.2%
r 6
 
6.2%
o 5
 
5.2%
, 5
 
5.2%
f 4
 
4.1%
Other values (18) 30
30.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 97
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
12
 
12.4%
i 8
 
8.2%
a 8
 
8.2%
e 7
 
7.2%
c 6
 
6.2%
t 6
 
6.2%
r 6
 
6.2%
o 5
 
5.2%
, 5
 
5.2%
f 4
 
4.1%
Other values (18) 30
30.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 97
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
12
 
12.4%
i 8
 
8.2%
a 8
 
8.2%
e 7
 
7.2%
c 6
 
6.2%
t 6
 
6.2%
r 6
 
6.2%
o 5
 
5.2%
, 5
 
5.2%
f 4
 
4.1%
Other values (18) 30
30.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 97
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
12
 
12.4%
i 8
 
8.2%
a 8
 
8.2%
e 7
 
7.2%
c 6
 
6.2%
t 6
 
6.2%
r 6
 
6.2%
o 5
 
5.2%
, 5
 
5.2%
f 4
 
4.1%
Other values (18) 30
30.9%

materialSampleID
Text

Constant  Missing 

Distinct1
Distinct (%)50.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:48.817614image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length13
Min length13

Characters and Unicode

Total characters26
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNORTH_AMERICA
2nd rowNORTH_AMERICA
ValueCountFrequency (%)
north_america 2
100.0%
2025-01-02T18:29:49.003571image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
R 4
15.4%
A 4
15.4%
N 2
7.7%
O 2
7.7%
T 2
7.7%
H 2
7.7%
_ 2
7.7%
M 2
7.7%
E 2
7.7%
I 2
7.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 26
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
R 4
15.4%
A 4
15.4%
N 2
7.7%
O 2
7.7%
T 2
7.7%
H 2
7.7%
_ 2
7.7%
M 2
7.7%
E 2
7.7%
I 2
7.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 26
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
R 4
15.4%
A 4
15.4%
N 2
7.7%
O 2
7.7%
T 2
7.7%
H 2
7.7%
_ 2
7.7%
M 2
7.7%
E 2
7.7%
I 2
7.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 26
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
R 4
15.4%
A 4
15.4%
N 2
7.7%
O 2
7.7%
T 2
7.7%
H 2
7.7%
_ 2
7.7%
M 2
7.7%
E 2
7.7%
I 2
7.7%

eventID
Text

Constant  Missing 

Distinct1
Distinct (%)100.0%
Missing1926388
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:49.075877image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length39
Median length39
Mean length39
Min length39

Characters and Unicode

Total characters39
Distinct characters19
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st rowNorth Pacific Ocean, Gulf Of California
ValueCountFrequency (%)
north 1
16.7%
pacific 1
16.7%
ocean 1
16.7%
gulf 1
16.7%
of 1
16.7%
california 1
16.7%
2025-01-02T18:29:49.194709image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
12.8%
f 4
10.3%
i 4
10.3%
a 4
10.3%
c 3
 
7.7%
o 2
 
5.1%
r 2
 
5.1%
n 2
 
5.1%
l 2
 
5.1%
O 2
 
5.1%
Other values (9) 9
23.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 39
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
5
12.8%
f 4
10.3%
i 4
10.3%
a 4
10.3%
c 3
 
7.7%
o 2
 
5.1%
r 2
 
5.1%
n 2
 
5.1%
l 2
 
5.1%
O 2
 
5.1%
Other values (9) 9
23.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 39
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
5
12.8%
f 4
10.3%
i 4
10.3%
a 4
10.3%
c 3
 
7.7%
o 2
 
5.1%
r 2
 
5.1%
n 2
 
5.1%
l 2
 
5.1%
O 2
 
5.1%
Other values (9) 9
23.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 39
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
5
12.8%
f 4
10.3%
i 4
10.3%
a 4
10.3%
c 3
 
7.7%
o 2
 
5.1%
r 2
 
5.1%
n 2
 
5.1%
l 2
 
5.1%
O 2
 
5.1%
Other values (9) 9
23.1%

parentEventID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

eventType
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

fieldNumber
Text

Missing 

Distinct62650
Distinct (%)10.7%
Missing1339757
Missing (%)69.5%
Memory size14.7 MiB
2025-01-02T18:29:49.352182image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length111
Median length63
Mean length13.61565854
Min length1

Characters and Unicode

Total characters7987381
Distinct characters82
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27488 ?
Unique (%)4.7%

Sample

1st rowMMS-CABP/02B-E4
2nd row4/III-23-TDS
3rd rowUSARP/EL/12/1002/USC
4th rowUSFC/A2059
5th rowUSFC/A5374
ValueCountFrequency (%)
mms-mafla/jar 17292
 
2.6%
bolland/rfb 7605
 
1.1%
humes 5243
 
0.8%
jpem 5028
 
0.8%
4975
 
0.8%
rh 2306
 
0.3%
k-rh 1557
 
0.2%
spm 1164
 
0.2%
mnhn-norfolk 1131
 
0.2%
haul 1040
 
0.2%
Other values (59084) 614436
92.8%
2025-01-02T18:29:49.595051image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 742746
 
9.3%
S 650689
 
8.1%
M 501373
 
6.3%
- 480056
 
6.0%
A 421866
 
5.3%
1 403237
 
5.0%
0 377830
 
4.7%
C 368160
 
4.6%
2 360966
 
4.5%
U 266532
 
3.3%
Other values (72) 3413926
42.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7987381
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
/ 742746
 
9.3%
S 650689
 
8.1%
M 501373
 
6.3%
- 480056
 
6.0%
A 421866
 
5.3%
1 403237
 
5.0%
0 377830
 
4.7%
C 368160
 
4.6%
2 360966
 
4.5%
U 266532
 
3.3%
Other values (72) 3413926
42.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7987381
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
/ 742746
 
9.3%
S 650689
 
8.1%
M 501373
 
6.3%
- 480056
 
6.0%
A 421866
 
5.3%
1 403237
 
5.0%
0 377830
 
4.7%
C 368160
 
4.6%
2 360966
 
4.5%
U 266532
 
3.3%
Other values (72) 3413926
42.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7987381
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
/ 742746
 
9.3%
S 650689
 
8.1%
M 501373
 
6.3%
- 480056
 
6.0%
A 421866
 
5.3%
1 403237
 
5.0%
0 377830
 
4.7%
C 368160
 
4.6%
2 360966
 
4.5%
U 266532
 
3.3%
Other values (72) 3413926
42.7%

eventDate
Text

Missing 

Distinct45561
Distinct (%)3.7%
Missing688611
Missing (%)35.7%
Memory size14.7 MiB
2025-01-02T18:29:49.748224image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length21
Median length10
Mean length9.825818523
Min length4

Characters and Unicode

Total characters12162182
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6824 ?
Unique (%)0.6%

Sample

1st row1976-03-03
2nd row1984-05-15
3rd row1964-03-15
4th row1883-08-31
5th row1909-03-02
ValueCountFrequency (%)
1915 6254
 
0.5%
1982-07-21 5684
 
0.5%
1981-07-06 5412
 
0.4%
1983-05-13 5155
 
0.4%
1982-11-19 5039
 
0.4%
1982-02-10 4461
 
0.4%
1981-11-09 4297
 
0.3%
1913 4293
 
0.3%
1982-05-10 4269
 
0.3%
1977-01-28/1977-02-13 3795
 
0.3%
Other values (45551) 1189119
96.1%
2025-01-02T18:29:49.959854image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 2343416
19.3%
- 2329123
19.2%
0 1804491
14.8%
9 1499547
12.3%
2 828830
 
6.8%
8 778905
 
6.4%
7 716496
 
5.9%
6 564566
 
4.6%
5 436404
 
3.6%
3 431149
 
3.5%
Other values (7) 429255
 
3.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 12162182
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1 2343416
19.3%
- 2329123
19.2%
0 1804491
14.8%
9 1499547
12.3%
2 828830
 
6.8%
8 778905
 
6.4%
7 716496
 
5.9%
6 564566
 
4.6%
5 436404
 
3.6%
3 431149
 
3.5%
Other values (7) 429255
 
3.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 12162182
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1 2343416
19.3%
- 2329123
19.2%
0 1804491
14.8%
9 1499547
12.3%
2 828830
 
6.8%
8 778905
 
6.4%
7 716496
 
5.9%
6 564566
 
4.6%
5 436404
 
3.6%
3 431149
 
3.5%
Other values (7) 429255
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 12162182
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1 2343416
19.3%
- 2329123
19.2%
0 1804491
14.8%
9 1499547
12.3%
2 828830
 
6.8%
8 778905
 
6.4%
7 716496
 
5.9%
6 564566
 
4.6%
5 436404
 
3.6%
3 431149
 
3.5%
Other values (7) 429255
 
3.5%

eventTime
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

startDayOfYear
Real number (ℝ)

Missing 

Distinct366
Distinct (%)< 0.1%
Missing842312
Missing (%)43.7%
Infinite0
Infinite (%)0.0%
Mean176.4092348
Minimum1
Maximum366
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:50.031307image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile28
Q1100
median175
Q3248
95-th percentile332
Maximum366
Range365
Interquartile range (IQR)148

Descriptive statistics

Standard deviation95.36922369
Coefficient of variation (CV)0.5406135558
Kurtosis-1.020939968
Mean176.4092348
Median Absolute Deviation (MAD)74
Skewness0.05928445511
Sum191241194
Variance9095.288827
MonotonicityNot monotonic
2025-01-02T18:29:50.099838image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202 9215
 
0.5%
133 9048
 
0.5%
187 8343
 
0.4%
130 7952
 
0.4%
323 7925
 
0.4%
41 7863
 
0.4%
145 7055
 
0.4%
313 6543
 
0.3%
175 6524
 
0.3%
263 6356
 
0.3%
Other values (356) 1007253
52.3%
(Missing) 842312
43.7%
ValueCountFrequency (%)
1 1012
0.1%
2 2206
0.1%
3 1327
0.1%
4 1069
0.1%
5 1588
0.1%
ValueCountFrequency (%)
366 194
 
< 0.1%
365 1197
0.1%
364 864
< 0.1%
363 768
< 0.1%
362 923
< 0.1%

endDayOfYear
Unsupported

Missing  Rejected  Unsupported 

Missing842310
Missing (%)43.7%
Memory size14.7 MiB

year
Real number (ℝ)

Missing 

Distinct207
Distinct (%)< 0.1%
Missing689273
Missing (%)35.8%
Infinite0
Infinite (%)0.0%
Mean1958.619874
Minimum1806
Maximum2024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:50.168102image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1806
5-th percentile1888
Q11938
median1970
Q31981
95-th percentile2002
Maximum2024
Range218
Interquartile range (IQR)43

Descriptive statistics

Standard deviation33.77200271
Coefficient of variation (CV)0.01724275505
Kurtosis-0.1590680529
Mean1958.619874
Median Absolute Deviation (MAD)13
Skewness-0.8552100928
Sum2423039984
Variance1140.548167
MonotonicityNot monotonic
2025-01-02T18:29:50.234136image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1977 73835
 
3.8%
1981 43749
 
2.3%
1976 42199
 
2.2%
1984 38196
 
2.0%
1982 38145
 
2.0%
1908 35299
 
1.8%
1983 34031
 
1.8%
1985 30482
 
1.6%
1964 28236
 
1.5%
1975 25013
 
1.3%
Other values (197) 847931
44.0%
(Missing) 689273
35.8%
ValueCountFrequency (%)
1806 1
 
< 0.1%
1809 1
 
< 0.1%
1816 3
< 0.1%
1817 1
 
< 0.1%
1818 1
 
< 0.1%
ValueCountFrequency (%)
2024 28
 
< 0.1%
2023 1677
0.1%
2022 980
0.1%
2021 667
 
< 0.1%
2020 96
 
< 0.1%

month
Real number (ℝ)

Missing 

Distinct12
Distinct (%)< 0.1%
Missing800939
Missing (%)41.6%
Infinite0
Infinite (%)0.0%
Mean6.351989871
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:50.289678image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median6
Q39
95-th percentile11
Maximum12
Range11
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.130330126
Coefficient of variation (CV)0.4928109442
Kurtosis-1.000608151
Mean6.351989871
Median Absolute Deviation (MAD)2
Skewness0.05549551892
Sum7148847
Variance9.798966696
MonotonicityNot monotonic
2025-01-02T18:29:50.339781image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
8 129893
 
6.7%
5 124557
 
6.5%
7 123176
 
6.4%
6 104254
 
5.4%
4 99638
 
5.2%
11 96677
 
5.0%
2 95459
 
5.0%
3 89439
 
4.6%
9 80447
 
4.2%
10 66176
 
3.4%
Other values (2) 115734
 
6.0%
(Missing) 800939
41.6%
ValueCountFrequency (%)
1 63333
3.3%
2 95459
5.0%
3 89439
4.6%
4 99638
5.2%
5 124557
6.5%
ValueCountFrequency (%)
12 52401
2.7%
11 96677
5.0%
10 66176
3.4%
9 80447
4.2%
8 129893
6.7%

day
Real number (ℝ)

Missing 

Distinct31
Distinct (%)< 0.1%
Missing887052
Missing (%)46.0%
Infinite0
Infinite (%)0.0%
Mean15.32566434
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:50.391817image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q18
median15
Q322
95-th percentile29
Maximum31
Range30
Interquartile range (IQR)14

Descriptive statistics

Standard deviation8.541758753
Coefficient of variation (CV)0.5573499825
Kurtosis-1.115071335
Mean15.32566434
Median Absolute Deviation (MAD)7
Skewness0.07233244424
Sum15928530
Variance72.96164259
MonotonicityNot monotonic
2025-01-02T18:29:50.448514image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
13 42864
 
2.2%
10 42434
 
2.2%
19 40651
 
2.1%
6 39463
 
2.0%
21 37986
 
2.0%
9 37781
 
2.0%
15 37214
 
1.9%
18 36290
 
1.9%
14 35493
 
1.8%
16 35079
 
1.8%
Other values (21) 654082
34.0%
(Missing) 887052
46.0%
ValueCountFrequency (%)
1 31620
1.6%
2 33326
1.7%
3 31847
1.7%
4 34956
1.8%
5 35035
1.8%
ValueCountFrequency (%)
31 17881
0.9%
30 27743
1.4%
29 26961
1.4%
28 29731
1.5%
27 28668
1.5%

verbatimEventDate
Text

Missing 

Distinct47775
Distinct (%)6.3%
Missing1173196
Missing (%)60.9%
Memory size14.7 MiB
2025-01-02T18:29:50.608446image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length181
Median length11
Mean length11.01797414
Min length1

Characters and Unicode

Total characters8298661
Distinct characters81
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15836 ?
Unique (%)2.1%

Sample

1st row-- --- ----
2nd row15 MAY 1984
3rd row15 MAR 1964
4th row03 MAR 1967
5th row31 AUG 1958
ValueCountFrequency (%)
275912
 
12.6%
may 68627
 
3.1%
aug 65853
 
3.0%
jul 61532
 
2.8%
apr 57935
 
2.6%
feb 53288
 
2.4%
jun 52783
 
2.4%
nov 52211
 
2.4%
mar 46122
 
2.1%
1977 42132
 
1.9%
Other values (8403) 1419005
64.6%
2025-01-02T18:29:50.859793image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1442207
17.4%
1 1077549
13.0%
9 807907
 
9.7%
- 749611
 
9.0%
2 340281
 
4.1%
7 334273
 
4.0%
0 322856
 
3.9%
8 301957
 
3.6%
6 296090
 
3.6%
A 274118
 
3.3%
Other values (71) 2351812
28.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 8298661
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1442207
17.4%
1 1077549
13.0%
9 807907
 
9.7%
- 749611
 
9.0%
2 340281
 
4.1%
7 334273
 
4.0%
0 322856
 
3.9%
8 301957
 
3.6%
6 296090
 
3.6%
A 274118
 
3.3%
Other values (71) 2351812
28.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 8298661
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1442207
17.4%
1 1077549
13.0%
9 807907
 
9.7%
- 749611
 
9.0%
2 340281
 
4.1%
7 334273
 
4.0%
0 322856
 
3.9%
8 301957
 
3.6%
6 296090
 
3.6%
A 274118
 
3.3%
Other values (71) 2351812
28.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 8298661
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1442207
17.4%
1 1077549
13.0%
9 807907
 
9.7%
- 749611
 
9.0%
2 340281
 
4.1%
7 334273
 
4.0%
0 322856
 
3.9%
8 301957
 
3.6%
6 296090
 
3.6%
A 274118
 
3.3%
Other values (71) 2351812
28.3%

habitat
Text

Missing 

Distinct18961
Distinct (%)27.4%
Missing1857134
Missing (%)96.4%
Memory size14.7 MiB
2025-01-02T18:29:51.024571image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length235
Median length159
Mean length19.79855606
Min length1

Characters and Unicode

Total characters1371149
Distinct characters89
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13600 ?
Unique (%)19.6%

Sample

1st rowBeach with fresh water creek running into it
2nd rowFreshwater
3rd rowIn sand
4th rowMangrove
5th rowUnder rocks
ValueCountFrequency (%)
freshwater 9208
 
4.1%
in 6886
 
3.1%
on 6374
 
2.8%
reef 6192
 
2.8%
sand 6092
 
2.7%
coral 5812
 
2.6%
of 4886
 
2.2%
rocks 4639
 
2.1%
sp 4290
 
1.9%
intertidal 4238
 
1.9%
Other values (6965) 165796
73.9%
2025-01-02T18:29:51.270131image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
155158
 
11.3%
e 134096
 
9.8%
a 117965
 
8.6%
r 101199
 
7.4%
n 83052
 
6.1%
s 82888
 
6.0%
o 79802
 
5.8%
t 71848
 
5.2%
i 60753
 
4.4%
l 60223
 
4.4%
Other values (79) 424165
30.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1371149
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
155158
 
11.3%
e 134096
 
9.8%
a 117965
 
8.6%
r 101199
 
7.4%
n 83052
 
6.1%
s 82888
 
6.0%
o 79802
 
5.8%
t 71848
 
5.2%
i 60753
 
4.4%
l 60223
 
4.4%
Other values (79) 424165
30.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1371149
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
155158
 
11.3%
e 134096
 
9.8%
a 117965
 
8.6%
r 101199
 
7.4%
n 83052
 
6.1%
s 82888
 
6.0%
o 79802
 
5.8%
t 71848
 
5.2%
i 60753
 
4.4%
l 60223
 
4.4%
Other values (79) 424165
30.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1371149
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
155158
 
11.3%
e 134096
 
9.8%
a 117965
 
8.6%
r 101199
 
7.4%
n 83052
 
6.1%
s 82888
 
6.0%
o 79802
 
5.8%
t 71848
 
5.2%
i 60753
 
4.4%
l 60223
 
4.4%
Other values (79) 424165
30.9%

samplingProtocol
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

sampleSizeValue
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

sampleSizeUnit
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

samplingEffort
Real number (ℝ)

Constant  Missing 

Distinct1
Distinct (%)100.0%
Missing1926388
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean24.1667
Minimum24.1667
Maximum24.1667
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:51.328779image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum24.1667
5-th percentile24.1667
Q124.1667
median24.1667
Q324.1667
95-th percentile24.1667
Maximum24.1667
Range0
Interquartile range (IQR)0

Descriptive statistics

Standard deviationnan
Coefficient of variation (CV)nan
Kurtosisnan
Mean24.1667
Median Absolute Deviation (MAD)0
Skewnessnan
Sum24.1667
Variancenan
MonotonicityStrictly increasing
2025-01-02T18:29:51.375316image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
ValueCountFrequency (%)
24.1667 1
 
< 0.1%
(Missing) 1926388
> 99.9%
ValueCountFrequency (%)
24.1667 1
< 0.1%
ValueCountFrequency (%)
24.1667 1
< 0.1%

fieldNotes
Real number (ℝ)

Constant  Missing 

Distinct1
Distinct (%)100.0%
Missing1926388
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean-110.283
Minimum-110.283
Maximum-110.283
Zeros0
Zeros (%)0.0%
Negative1
Negative (%)< 0.1%
Memory size14.7 MiB
2025-01-02T18:29:51.491760image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum-110.283
5-th percentile-110.283
Q1-110.283
median-110.283
Q3-110.283
95-th percentile-110.283
Maximum-110.283
Range0
Interquartile range (IQR)0

Descriptive statistics

Standard deviationnan
Coefficient of variation (CV)nan
Kurtosisnan
Mean-110.283
Median Absolute Deviation (MAD)0
Skewnessnan
Sum-110.283
Variancenan
MonotonicityStrictly increasing
2025-01-02T18:29:51.541119image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
ValueCountFrequency (%)
-110.283 1
 
< 0.1%
(Missing) 1926388
> 99.9%
ValueCountFrequency (%)
-110.283 1
< 0.1%
ValueCountFrequency (%)
-110.283 1
< 0.1%

eventRemarks
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

locationID
Text

Missing 

Distinct94702
Distinct (%)10.0%
Missing984063
Missing (%)51.1%
Memory size14.7 MiB
2025-01-02T18:29:51.708811image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length146
Median length133
Mean length4.431840998
Min length1

Characters and Unicode

Total characters4176239
Distinct characters88
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52903 ?
Unique (%)5.6%

Sample

1st rowE4
2nd rowNR 12-4 ID 101
3rd row23
4th row1002
5th row2059
ValueCountFrequency (%)
not 12392
 
1.2%
rec 12070
 
1.2%
4 8474
 
0.8%
rhb 7696
 
0.7%
rfb 7623
 
0.7%
1 7565
 
0.7%
2 6224
 
0.6%
3 5488
 
0.5%
gs 5168
 
0.5%
6 5011
 
0.5%
Other values (80226) 962550
92.5%
2025-01-02T18:29:51.965298image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 473501
 
11.3%
2 393285
 
9.4%
0 331166
 
7.9%
5 295493
 
7.1%
3 287149
 
6.9%
4 263595
 
6.3%
- 261708
 
6.3%
6 216159
 
5.2%
7 190483
 
4.6%
8 180376
 
4.3%
Other values (78) 1283324
30.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4176239
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1 473501
 
11.3%
2 393285
 
9.4%
0 331166
 
7.9%
5 295493
 
7.1%
3 287149
 
6.9%
4 263595
 
6.3%
- 261708
 
6.3%
6 216159
 
5.2%
7 190483
 
4.6%
8 180376
 
4.3%
Other values (78) 1283324
30.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4176239
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1 473501
 
11.3%
2 393285
 
9.4%
0 331166
 
7.9%
5 295493
 
7.1%
3 287149
 
6.9%
4 263595
 
6.3%
- 261708
 
6.3%
6 216159
 
5.2%
7 190483
 
4.6%
8 180376
 
4.3%
Other values (78) 1283324
30.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4176239
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1 473501
 
11.3%
2 393285
 
9.4%
0 331166
 
7.9%
5 295493
 
7.1%
3 287149
 
6.9%
4 263595
 
6.3%
- 261708
 
6.3%
6 216159
 
5.2%
7 190483
 
4.6%
8 180376
 
4.3%
Other values (78) 1283324
30.7%

higherGeographyID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

higherGeography
Text

Missing 

Distinct12370
Distinct (%)0.7%
Missing67830
Missing (%)3.5%
Memory size14.7 MiB
2025-01-02T18:29:52.109399image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length126
Median length104
Mean length36.17341177
Min length4

Characters and Unicode

Total characters67230420
Distinct characters77
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3190 ?
Unique (%)0.2%

Sample

1st rowNorth Atlantic Ocean, United States
2nd rowNorth Atlantic Ocean, Gulf of Mexico, United States, Florida
3rd rowNorth Atlantic Ocean, Caribbean Sea, Barbados
4th rowNorth Atlantic Ocean, Gulf of Mexico, United States, Florida
5th rowPhilippines
ValueCountFrequency (%)
ocean 1259909
 
13.4%
north 1098148
 
11.7%
united 886187
 
9.4%
states 871605
 
9.3%
atlantic 718309
 
7.7%
pacific 437003
 
4.7%
mexico 248368
 
2.6%
of 243369
 
2.6%
gulf 228771
 
2.4%
south 203325
 
2.2%
Other values (4652) 3191440
34.0%
2025-01-02T18:29:52.333894image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7527875
 
11.2%
a 6865358
 
10.2%
t 6256792
 
9.3%
i 4780189
 
7.1%
e 4733936
 
7.0%
n 4584428
 
6.8%
c 3760389
 
5.6%
o 2897124
 
4.3%
, 2857280
 
4.2%
r 2272061
 
3.4%
Other values (67) 20694988
30.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 67230420
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
7527875
 
11.2%
a 6865358
 
10.2%
t 6256792
 
9.3%
i 4780189
 
7.1%
e 4733936
 
7.0%
n 4584428
 
6.8%
c 3760389
 
5.6%
o 2897124
 
4.3%
, 2857280
 
4.2%
r 2272061
 
3.4%
Other values (67) 20694988
30.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 67230420
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
7527875
 
11.2%
a 6865358
 
10.2%
t 6256792
 
9.3%
i 4780189
 
7.1%
e 4733936
 
7.0%
n 4584428
 
6.8%
c 3760389
 
5.6%
o 2897124
 
4.3%
, 2857280
 
4.2%
r 2272061
 
3.4%
Other values (67) 20694988
30.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 67230420
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
7527875
 
11.2%
a 6865358
 
10.2%
t 6256792
 
9.3%
i 4780189
 
7.1%
e 4733936
 
7.0%
n 4584428
 
6.8%
c 3760389
 
5.6%
o 2897124
 
4.3%
, 2857280
 
4.2%
r 2272061
 
3.4%
Other values (67) 20694988
30.8%

continent
Text

Missing 

Distinct7
Distinct (%)< 0.1%
Missing1027390
Missing (%)53.3%
Memory size14.7 MiB
2025-01-02T18:29:52.408169image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length9.980889856
Min length4

Characters and Unicode

Total characters8972810
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNORTH_AMERICA
2nd rowASIA
3rd rowNORTH_AMERICA
4th rowOCEANIA
5th rowNORTH_AMERICA
ValueCountFrequency (%)
north_america 475001
52.8%
oceania 155883
 
17.3%
asia 135716
 
15.1%
south_america 44254
 
4.9%
africa 39371
 
4.4%
europe 33879
 
3.8%
antarctica 14895
 
1.7%
2025-01-02T18:29:52.523357image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 1745135
19.4%
R 1082401
12.1%
I 865120
9.6%
C 744299
8.3%
E 742896
8.3%
O 709017
7.9%
N 645779
 
7.2%
T 549045
 
6.1%
H 519255
 
5.8%
_ 519255
 
5.8%
Other values (5) 850608
9.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 8972810
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
A 1745135
19.4%
R 1082401
12.1%
I 865120
9.6%
C 744299
8.3%
E 742896
8.3%
O 709017
7.9%
N 645779
 
7.2%
T 549045
 
6.1%
H 519255
 
5.8%
_ 519255
 
5.8%
Other values (5) 850608
9.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 8972810
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
A 1745135
19.4%
R 1082401
12.1%
I 865120
9.6%
C 744299
8.3%
E 742896
8.3%
O 709017
7.9%
N 645779
 
7.2%
T 549045
 
6.1%
H 519255
 
5.8%
_ 519255
 
5.8%
Other values (5) 850608
9.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 8972810
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
A 1745135
19.4%
R 1082401
12.1%
I 865120
9.6%
C 744299
8.3%
E 742896
8.3%
O 709017
7.9%
N 645779
 
7.2%
T 549045
 
6.1%
H 519255
 
5.8%
_ 519255
 
5.8%
Other values (5) 850608
9.5%

waterBody
Text

Missing 

Distinct1655
Distinct (%)0.1%
Missing666647
Missing (%)34.6%
Memory size14.7 MiB
2025-01-02T18:29:52.593537image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length76
Median length75
Mean length24.49184833
Min length7

Characters and Unicode

Total characters30853410
Distinct characters63
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique510 ?
Unique (%)< 0.1%

Sample

1st rowNorth Atlantic Ocean
2nd rowNorth Atlantic Ocean, Gulf of Mexico
3rd rowNorth Atlantic Ocean, Caribbean Sea
4th rowNorth Atlantic Ocean, Gulf of Mexico
5th rowAntarctic Ocean
ValueCountFrequency (%)
ocean 1259434
26.1%
north 998553
20.7%
atlantic 718247
14.9%
pacific 436962
 
9.1%
of 231313
 
4.8%
gulf 228638
 
4.7%
sea 193896
 
4.0%
mexico 187756
 
3.9%
south 160377
 
3.3%
caribbean 89358
 
1.9%
Other values (1319) 318010
 
6.6%
2025-01-02T18:29:52.740786image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3562802
11.5%
c 3175906
10.3%
a 3113538
 
10.1%
t 2738941
 
8.9%
n 2331622
 
7.6%
i 2082746
 
6.8%
e 1823700
 
5.9%
o 1648330
 
5.3%
O 1261125
 
4.1%
r 1218140
 
3.9%
Other values (53) 7896560
25.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 30853410
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
3562802
11.5%
c 3175906
10.3%
a 3113538
 
10.1%
t 2738941
 
8.9%
n 2331622
 
7.6%
i 2082746
 
6.8%
e 1823700
 
5.9%
o 1648330
 
5.3%
O 1261125
 
4.1%
r 1218140
 
3.9%
Other values (53) 7896560
25.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 30853410
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
3562802
11.5%
c 3175906
10.3%
a 3113538
 
10.1%
t 2738941
 
8.9%
n 2331622
 
7.6%
i 2082746
 
6.8%
e 1823700
 
5.9%
o 1648330
 
5.3%
O 1261125
 
4.1%
r 1218140
 
3.9%
Other values (53) 7896560
25.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 30853410
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
3562802
11.5%
c 3175906
10.3%
a 3113538
 
10.1%
t 2738941
 
8.9%
n 2331622
 
7.6%
i 2082746
 
6.8%
e 1823700
 
5.9%
o 1648330
 
5.3%
O 1261125
 
4.1%
r 1218140
 
3.9%
Other values (53) 7896560
25.6%

islandGroup
Text

Missing 

Distinct20
Distinct (%)2.6%
Missing1925619
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:52.810464image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length15
Mean length14.52857143
Min length5

Characters and Unicode

Total characters11187
Distinct characters35
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.8%

Sample

1st rowSociety Islands
2nd rowSociety Islands
3rd rowSociety Islands
4th rowSociety Islands
5th rowSociety Islands
ValueCountFrequency (%)
islands 707
47.0%
society 679
45.2%
exuma 20
 
1.3%
south 12
 
0.8%
sandwich 12
 
0.8%
florida 10
 
0.7%
keys 10
 
0.7%
pacific 10
 
0.7%
carolina 8
 
0.5%
aleutian 7
 
0.5%
Other values (14) 28
 
1.9%
2025-01-02T18:29:52.943496image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 1446
12.9%
a 803
 
7.2%
l 751
 
6.7%
n 748
 
6.7%
i 743
 
6.6%
d 738
 
6.6%
733
 
6.6%
o 722
 
6.5%
c 713
 
6.4%
e 711
 
6.4%
Other values (25) 3079
27.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 11187
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
s 1446
12.9%
a 803
 
7.2%
l 751
 
6.7%
n 748
 
6.7%
i 743
 
6.6%
d 738
 
6.6%
733
 
6.6%
o 722
 
6.5%
c 713
 
6.4%
e 711
 
6.4%
Other values (25) 3079
27.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 11187
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
s 1446
12.9%
a 803
 
7.2%
l 751
 
6.7%
n 748
 
6.7%
i 743
 
6.6%
d 738
 
6.6%
733
 
6.6%
o 722
 
6.5%
c 713
 
6.4%
e 711
 
6.4%
Other values (25) 3079
27.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 11187
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
s 1446
12.9%
a 803
 
7.2%
l 751
 
6.7%
n 748
 
6.7%
i 743
 
6.6%
d 738
 
6.6%
733
 
6.6%
o 722
 
6.5%
c 713
 
6.4%
e 711
 
6.4%
Other values (25) 3079
27.5%

island
Text

Missing 

Distinct58
Distinct (%)5.9%
Missing1925411
Missing (%)99.9%
Memory size14.7 MiB
2025-01-02T18:29:53.046747image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length25
Median length6
Mean length6.676891616
Min length4

Characters and Unicode

Total characters6530
Distinct characters49
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)3.4%

Sample

1st rowMoorea
2nd rowMoorea
3rd rowShikoku
4th rowOahu
5th rowMoorea
ValueCountFrequency (%)
moorea 674
60.4%
oahu 147
 
13.2%
island 91
 
8.2%
great 20
 
1.8%
exuma 20
 
1.8%
eniwetok 13
 
1.2%
nunivak 13
 
1.2%
bonaire 11
 
1.0%
key 10
 
0.9%
west 10
 
0.9%
Other values (58) 106
 
9.5%
2025-01-02T18:29:53.199208image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 1430
21.9%
a 1060
16.2%
e 771
11.8%
r 737
11.3%
M 683
10.5%
u 225
 
3.4%
n 186
 
2.8%
h 170
 
2.6%
O 154
 
2.4%
137
 
2.1%
Other values (39) 977
15.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 6530
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 1430
21.9%
a 1060
16.2%
e 771
11.8%
r 737
11.3%
M 683
10.5%
u 225
 
3.4%
n 186
 
2.8%
h 170
 
2.6%
O 154
 
2.4%
137
 
2.1%
Other values (39) 977
15.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 6530
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 1430
21.9%
a 1060
16.2%
e 771
11.8%
r 737
11.3%
M 683
10.5%
u 225
 
3.4%
n 186
 
2.8%
h 170
 
2.6%
O 154
 
2.4%
137
 
2.1%
Other values (39) 977
15.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 6530
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 1430
21.9%
a 1060
16.2%
e 771
11.8%
r 737
11.3%
M 683
10.5%
u 225
 
3.4%
n 186
 
2.8%
h 170
 
2.6%
O 154
 
2.4%
137
 
2.1%
Other values (39) 977
15.0%

countryCode
Text

Missing 

Distinct239
Distinct (%)< 0.1%
Missing110758
Missing (%)5.7%
Memory size14.7 MiB
2025-01-02T18:29:53.354896image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters3631262
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st rowUS
2nd rowUS
3rd rowBB
4th rowUS
5th rowPH
ValueCountFrequency (%)
us 868580
47.8%
ph 93802
 
5.2%
mx 59371
 
3.3%
pa 46369
 
2.6%
aq 44802
 
2.5%
jp 38538
 
2.1%
cu 30147
 
1.7%
ca 28674
 
1.6%
jm 27586
 
1.5%
pf 27226
 
1.5%
Other values (229) 550536
30.3%
2025-01-02T18:29:53.575536image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
U 948158
26.1%
S 926973
25.5%
P 250779
 
6.9%
A 177982
 
4.9%
M 160911
 
4.4%
H 143259
 
3.9%
C 133182
 
3.7%
B 95322
 
2.6%
J 78390
 
2.2%
G 66596
 
1.8%
Other values (16) 649710
17.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3631262
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
U 948158
26.1%
S 926973
25.5%
P 250779
 
6.9%
A 177982
 
4.9%
M 160911
 
4.4%
H 143259
 
3.9%
C 133182
 
3.7%
B 95322
 
2.6%
J 78390
 
2.2%
G 66596
 
1.8%
Other values (16) 649710
17.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3631262
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
U 948158
26.1%
S 926973
25.5%
P 250779
 
6.9%
A 177982
 
4.9%
M 160911
 
4.4%
H 143259
 
3.9%
C 133182
 
3.7%
B 95322
 
2.6%
J 78390
 
2.2%
G 66596
 
1.8%
Other values (16) 649710
17.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3631262
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
U 948158
26.1%
S 926973
25.5%
P 250779
 
6.9%
A 177982
 
4.9%
M 160911
 
4.4%
H 143259
 
3.9%
C 133182
 
3.7%
B 95322
 
2.6%
J 78390
 
2.2%
G 66596
 
1.8%
Other values (16) 649710
17.9%

stateProvince
Text

Missing 

Distinct1326
Distinct (%)0.1%
Missing943672
Missing (%)49.0%
Memory size14.7 MiB
2025-01-02T18:29:53.748578image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length51
Median length39
Mean length9.182681281
Min length3

Characters and Unicode

Total characters9023977
Distinct characters70
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique281 ?
Unique (%)< 0.1%

Sample

1st rowFlorida
2nd rowFlorida
3rd rowMassachusetts
4th rowQuezon
5th rowNewfoundland
ValueCountFrequency (%)
florida 157981
 
13.1%
massachusetts 103383
 
8.6%
california 57085
 
4.7%
carolina 53929
 
4.5%
texas 43591
 
3.6%
alaska 41859
 
3.5%
north 31994
 
2.7%
louisiana 28645
 
2.4%
hawaii 26401
 
2.2%
south 26211
 
2.2%
Other values (1250) 635016
52.7%
2025-01-02T18:29:53.992421image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1427947
15.8%
i 809012
 
9.0%
s 773253
 
8.6%
o 650880
 
7.2%
r 519439
 
5.8%
l 506659
 
5.6%
n 498663
 
5.5%
e 457617
 
5.1%
t 400633
 
4.4%
u 277325
 
3.1%
Other values (60) 2702549
29.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 9023977
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 1427947
15.8%
i 809012
 
9.0%
s 773253
 
8.6%
o 650880
 
7.2%
r 519439
 
5.8%
l 506659
 
5.6%
n 498663
 
5.5%
e 457617
 
5.1%
t 400633
 
4.4%
u 277325
 
3.1%
Other values (60) 2702549
29.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 9023977
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 1427947
15.8%
i 809012
 
9.0%
s 773253
 
8.6%
o 650880
 
7.2%
r 519439
 
5.8%
l 506659
 
5.6%
n 498663
 
5.5%
e 457617
 
5.1%
t 400633
 
4.4%
u 277325
 
3.1%
Other values (60) 2702549
29.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 9023977
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 1427947
15.8%
i 809012
 
9.0%
s 773253
 
8.6%
o 650880
 
7.2%
r 519439
 
5.8%
l 506659
 
5.6%
n 498663
 
5.5%
e 457617
 
5.1%
t 400633
 
4.4%
u 277325
 
3.1%
Other values (60) 2702549
29.9%

county
Text

Missing 

Distinct2594
Distinct (%)1.9%
Missing1786419
Missing (%)92.7%
Memory size14.7 MiB
2025-01-02T18:29:54.145536image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length46
Median length43
Mean length14.35976281
Min length3

Characters and Unicode

Total characters2009936
Distinct characters65
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique558 ?
Unique (%)0.4%

Sample

1st rowCumberland County
2nd rowAllamakee County
3rd rowSt. Lucie County
4th rowDelaware County
5th rowKimble County
ValueCountFrequency (%)
county 135420
45.4%
st 3893
 
1.3%
parish 3203
 
1.1%
monroe 3117
 
1.0%
lucie 2649
 
0.9%
montgomery 2553
 
0.9%
san 2117
 
0.7%
prince 1875
 
0.6%
george's 1763
 
0.6%
jackson 1748
 
0.6%
Other values (2256) 139873
46.9%
2025-01-02T18:29:54.366934image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 223764
11.1%
o 216841
10.8%
t 181044
 
9.0%
u 160921
 
8.0%
158241
 
7.9%
C 152411
 
7.6%
y 151816
 
7.6%
e 105732
 
5.3%
a 103264
 
5.1%
r 74021
 
3.7%
Other values (55) 481881
24.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2009936
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
n 223764
11.1%
o 216841
10.8%
t 181044
 
9.0%
u 160921
 
8.0%
158241
 
7.9%
C 152411
 
7.6%
y 151816
 
7.6%
e 105732
 
5.3%
a 103264
 
5.1%
r 74021
 
3.7%
Other values (55) 481881
24.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2009936
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
n 223764
11.1%
o 216841
10.8%
t 181044
 
9.0%
u 160921
 
8.0%
158241
 
7.9%
C 152411
 
7.6%
y 151816
 
7.6%
e 105732
 
5.3%
a 103264
 
5.1%
r 74021
 
3.7%
Other values (55) 481881
24.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2009936
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
n 223764
11.1%
o 216841
10.8%
t 181044
 
9.0%
u 160921
 
8.0%
158241
 
7.9%
C 152411
 
7.6%
y 151816
 
7.6%
e 105732
 
5.3%
a 103264
 
5.1%
r 74021
 
3.7%
Other values (55) 481881
24.0%

municipality
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

locality
Text

Missing 

Distinct204737
Distinct (%)15.9%
Missing642385
Missing (%)33.3%
Memory size14.7 MiB
2025-01-02T18:29:54.524734image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length588
Median length368
Mean length28.96689185
Min length1

Characters and Unicode

Total characters37193605
Distinct characters137
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique126312 ?
Unique (%)9.8%

Sample

1st rowoff Delaware
2nd rowW Coast
3rd rowCape Sable, West Of
4th rowAntarctic Peninsula
5th rowGeorges Bank
ValueCountFrequency (%)
island 342350
 
5.6%
of 336413
 
5.5%
off 252659
 
4.1%
bay 137529
 
2.2%
islands 98145
 
1.6%
bank 84596
 
1.4%
south 74617
 
1.2%
georges 66662
 
1.1%
florida 63422
 
1.0%
river 63368
 
1.0%
Other values (76623) 4632728
75.3%
2025-01-02T18:29:54.766147image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4868486
 
13.1%
a 3496721
 
9.4%
e 2450119
 
6.6%
o 2295753
 
6.2%
n 2154016
 
5.8%
r 1673799
 
4.5%
s 1628291
 
4.4%
i 1596676
 
4.3%
l 1583563
 
4.3%
t 1474946
 
4.0%
Other values (127) 13971235
37.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 37193605
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
4868486
 
13.1%
a 3496721
 
9.4%
e 2450119
 
6.6%
o 2295753
 
6.2%
n 2154016
 
5.8%
r 1673799
 
4.5%
s 1628291
 
4.4%
i 1596676
 
4.3%
l 1583563
 
4.3%
t 1474946
 
4.0%
Other values (127) 13971235
37.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 37193605
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
4868486
 
13.1%
a 3496721
 
9.4%
e 2450119
 
6.6%
o 2295753
 
6.2%
n 2154016
 
5.8%
r 1673799
 
4.5%
s 1628291
 
4.4%
i 1596676
 
4.3%
l 1583563
 
4.3%
t 1474946
 
4.0%
Other values (127) 13971235
37.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 37193605
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
4868486
 
13.1%
a 3496721
 
9.4%
e 2450119
 
6.6%
o 2295753
 
6.2%
n 2154016
 
5.8%
r 1673799
 
4.5%
s 1628291
 
4.4%
i 1596676
 
4.3%
l 1583563
 
4.3%
t 1474946
 
4.0%
Other values (127) 13971235
37.6%

verbatimLocality
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

verbatimElevation
Unsupported

Missing  Rejected  Unsupported 

Missing1925927
Missing (%)> 99.9%
Memory size14.7 MiB

verticalDatum
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

verbatimDepth
Text

Missing 

Distinct1530
Distinct (%)5.8%
Missing1900145
Missing (%)98.6%
Memory size14.7 MiB
2025-01-02T18:29:54.949398image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length99
Median length91
Mean length13.43716659
Min length1

Characters and Unicode

Total characters352645
Distinct characters79
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique721 ?
Unique (%)2.7%

Sample

1st rowSurface
2nd rowmax depth 1772 ft
3rd rowsurface
4th rowIntertidal
5th rowIntertidal
ValueCountFrequency (%)
intertidal 11932
23.4%
surface 4085
 
8.0%
recorded 2871
 
5.6%
depths 2850
 
5.6%
multiple 2846
 
5.6%
shore 1165
 
2.3%
0-300 1120
 
2.2%
0 1069
 
2.1%
depth 1023
 
2.0%
low 964
 
1.9%
Other values (1043) 21003
41.2%
2025-01-02T18:29:55.204159image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 36687
 
10.4%
e 35142
 
10.0%
r 25391
 
7.2%
24684
 
7.0%
d 24177
 
6.9%
l 20651
 
5.9%
a 20481
 
5.8%
i 19392
 
5.5%
0 16029
 
4.5%
n 14727
 
4.2%
Other values (69) 115284
32.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 352645
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t 36687
 
10.4%
e 35142
 
10.0%
r 25391
 
7.2%
24684
 
7.0%
d 24177
 
6.9%
l 20651
 
5.9%
a 20481
 
5.8%
i 19392
 
5.5%
0 16029
 
4.5%
n 14727
 
4.2%
Other values (69) 115284
32.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 352645
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t 36687
 
10.4%
e 35142
 
10.0%
r 25391
 
7.2%
24684
 
7.0%
d 24177
 
6.9%
l 20651
 
5.9%
a 20481
 
5.8%
i 19392
 
5.5%
0 16029
 
4.5%
n 14727
 
4.2%
Other values (69) 115284
32.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 352645
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t 36687
 
10.4%
e 35142
 
10.0%
r 25391
 
7.2%
24684
 
7.0%
d 24177
 
6.9%
l 20651
 
5.9%
a 20481
 
5.8%
i 19392
 
5.5%
0 16029
 
4.5%
n 14727
 
4.2%
Other values (69) 115284
32.7%

minimumDistanceAboveSurfaceInMeters
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

maximumDistanceAboveSurfaceInMeters
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

locationAccordingTo
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

locationRemarks
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

decimalLatitude
Real number (ℝ)

Missing 

Distinct70087
Distinct (%)7.0%
Missing927342
Missing (%)48.1%
Infinite0
Infinite (%)0.0%
Mean19.53293445
Minimum-90
Maximum86.85
Zeros126
Zeros (%)< 0.1%
Negative152585
Negative (%)7.9%
Memory size14.7 MiB
2025-01-02T18:29:55.417319image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum-90
5-th percentile-56.108
Q111.5
median27.8867
Q338.2881
95-th percentile45.75397
Maximum86.85
Range176.85
Interquartile range (IQR)26.7881

Descriptive statistics

Standard deviation28.76997469
Coefficient of variation (CV)1.472895676
Kurtosis2.539179044
Mean19.53293445
Median Absolute Deviation (MAD)11.395
Skewness-1.700199564
Sum19514319.57
Variance827.7114436
MonotonicityNot monotonic
2025-01-02T18:29:55.482361image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25.58 10487
 
0.5%
40.6583 8821
 
0.5%
26.17 7319
 
0.4%
26.5 5153
 
0.3%
26.97 3942
 
0.2%
25.7883 3457
 
0.2%
9.4 3081
 
0.2%
40.895 2590
 
0.1%
40.66 2520
 
0.1%
25.2967 2475
 
0.1%
Other values (70077) 949202
49.3%
(Missing) 927342
48.1%
ValueCountFrequency (%)
-90 1
 
< 0.1%
-88.983 1
 
< 0.1%
-87.55 3
 
< 0.1%
-82.375 11
< 0.1%
-78.9167 3
 
< 0.1%
ValueCountFrequency (%)
86.85 1
 
< 0.1%
86.618 1
 
< 0.1%
85.9733 4
< 0.1%
85.9583 5
< 0.1%
85.6183 1
 
< 0.1%

decimalLongitude
Real number (ℝ)

Missing 

Distinct74625
Distinct (%)7.5%
Missing927342
Missing (%)48.1%
Infinite0
Infinite (%)0.0%
Mean-49.51288993
Minimum-180
Maximum180
Zeros13
Zeros (%)< 0.1%
Negative826266
Negative (%)42.9%
Memory size14.7 MiB
2025-01-02T18:29:55.548919image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum-180
5-th percentile-132.4
Q1-86.0583
median-77.2831
Q3-64.1125
95-th percentile134.07
Maximum180
Range360
Interquartile range (IQR)21.9458

Descriptive statistics

Standard deviation81.34966635
Coefficient of variation (CV)-1.642999762
Kurtosis1.251841924
Mean-49.51288993
Median Absolute Deviation (MAD)10.5081
Skewness1.527721701
Sum-49465704.14
Variance6617.768215
MonotonicityNot monotonic
2025-01-02T18:29:55.615234image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-80.1 10464
 
0.5%
127.848 4532
 
0.2%
-67.7683 4213
 
0.2%
-80.13 3736
 
0.2%
-82.7 3516
 
0.2%
-67.77 2821
 
0.1%
-66.775 2592
 
0.1%
-81.6633 2462
 
0.1%
-70.6731 2397
 
0.1%
-67.755 2356
 
0.1%
Other values (74615) 959958
49.8%
(Missing) 927342
48.1%
ValueCountFrequency (%)
-180 8
 
< 0.1%
-179.994 2
 
< 0.1%
-179.98 11
< 0.1%
-179.971 1
 
< 0.1%
-179.97 26
< 0.1%
ValueCountFrequency (%)
180 27
< 0.1%
179.994 1
 
< 0.1%
179.98 16
< 0.1%
179.977 1
 
< 0.1%
179.954 1
 
< 0.1%

coordinateUncertaintyInMeters
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

coordinatePrecision
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

pointRadiusSpatialFit
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB
Distinct9
Distinct (%)< 0.1%
Missing1246881
Missing (%)64.7%
Memory size14.7 MiB
2025-01-02T18:29:55.685747image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length23
Median length23
Mean length22.60567057
Min length3

Characters and Unicode

Total characters15360734
Distinct characters30
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowDegrees Minutes Seconds
2nd rowDegrees Minutes Seconds
3rd rowDegrees Minutes Seconds
4th rowDegrees Minutes Seconds
5th rowDegrees Minutes Seconds
ValueCountFrequency (%)
degrees 670900
33.4%
minutes 648195
32.3%
seconds 648195
32.3%
decimal 22705
 
1.1%
township 7004
 
0.3%
range 7004
 
0.3%
marsden 605
 
< 0.1%
square 605
 
< 0.1%
unknown 532
 
< 0.1%
utm 464
 
< 0.1%
Other values (3) 6
 
< 0.1%
2025-01-02T18:29:55.816926image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 3340010
21.7%
s 1974899
12.9%
1326707
 
8.6%
n 1312599
 
8.5%
i 677904
 
4.4%
g 677904
 
4.4%
r 672113
 
4.4%
d 671463
 
4.4%
D 670945
 
4.4%
c 670901
 
4.4%
Other values (20) 3365289
21.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 15360734
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 3340010
21.7%
s 1974899
12.9%
1326707
 
8.6%
n 1312599
 
8.5%
i 677904
 
4.4%
g 677904
 
4.4%
r 672113
 
4.4%
d 671463
 
4.4%
D 670945
 
4.4%
c 670901
 
4.4%
Other values (20) 3365289
21.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 15360734
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 3340010
21.7%
s 1974899
12.9%
1326707
 
8.6%
n 1312599
 
8.5%
i 677904
 
4.4%
g 677904
 
4.4%
r 672113
 
4.4%
d 671463
 
4.4%
D 670945
 
4.4%
c 670901
 
4.4%
Other values (20) 3365289
21.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 15360734
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 3340010
21.7%
s 1974899
12.9%
1326707
 
8.6%
n 1312599
 
8.5%
i 677904
 
4.4%
g 677904
 
4.4%
r 672113
 
4.4%
d 671463
 
4.4%
D 670945
 
4.4%
c 670901
 
4.4%
Other values (20) 3365289
21.9%

verbatimSRS
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

footprintWKT
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

footprintSRS
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

footprintSpatialFit
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

georeferencedBy
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

georeferencedDate
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

georeferenceProtocol
Text

Missing 

Distinct112
Distinct (%)< 0.1%
Missing1265789
Missing (%)65.7%
Memory size14.7 MiB
2025-01-02T18:29:55.918048image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length87
Median length20
Mean length20.10029973
Min length3

Characters and Unicode

Total characters13278258
Distinct characters64
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)< 0.1%

Sample

1st rowunknown, from legacy
2nd rowunknown, from legacy
3rd rowunknown, from legacy
4th rowunknown, from legacy
5th rowunknown, from legacy
ValueCountFrequency (%)
from 509060
26.2%
unknown 507577
26.1%
legacy 505126
26.0%
geolocate 70310
 
3.6%
names 41937
 
2.2%
geographic 41556
 
2.1%
of 35279
 
1.8%
getty 34687
 
1.8%
thesaurus 34686
 
1.8%
may 23191
 
1.2%
Other values (124) 141515
 
7.3%
2025-01-02T18:29:56.146592image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 1560806
 
11.8%
1284324
 
9.7%
o 1253392
 
9.4%
e 822045
 
6.2%
a 797024
 
6.0%
r 642024
 
4.8%
c 624646
 
4.7%
g 591299
 
4.5%
u 580748
 
4.4%
y 577424
 
4.3%
Other values (54) 4544526
34.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 13278258
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
n 1560806
 
11.8%
1284324
 
9.7%
o 1253392
 
9.4%
e 822045
 
6.2%
a 797024
 
6.0%
r 642024
 
4.8%
c 624646
 
4.7%
g 591299
 
4.5%
u 580748
 
4.4%
y 577424
 
4.3%
Other values (54) 4544526
34.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 13278258
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
n 1560806
 
11.8%
1284324
 
9.7%
o 1253392
 
9.4%
e 822045
 
6.2%
a 797024
 
6.0%
r 642024
 
4.8%
c 624646
 
4.7%
g 591299
 
4.5%
u 580748
 
4.4%
y 577424
 
4.3%
Other values (54) 4544526
34.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 13278258
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
n 1560806
 
11.8%
1284324
 
9.7%
o 1253392
 
9.4%
e 822045
 
6.2%
a 797024
 
6.0%
r 642024
 
4.8%
c 624646
 
4.7%
g 591299
 
4.5%
u 580748
 
4.4%
y 577424
 
4.3%
Other values (54) 4544526
34.2%

georeferenceSources
Text

Constant  Missing 

Distinct1
Distinct (%)50.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:56.209536image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters16
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPARATYPE
2nd rowPARATYPE
ValueCountFrequency (%)
paratype 2
100.0%
2025-01-02T18:29:56.321800image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
P 4
25.0%
A 4
25.0%
R 2
12.5%
T 2
12.5%
Y 2
12.5%
E 2
12.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 16
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
P 4
25.0%
A 4
25.0%
R 2
12.5%
T 2
12.5%
Y 2
12.5%
E 2
12.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 16
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
P 4
25.0%
A 4
25.0%
R 2
12.5%
T 2
12.5%
Y 2
12.5%
E 2
12.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 16
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
P 4
25.0%
A 4
25.0%
R 2
12.5%
T 2
12.5%
Y 2
12.5%
E 2
12.5%

georeferenceRemarks
Text

Missing 

Distinct4822
Distinct (%)15.9%
Missing1896101
Missing (%)98.4%
Memory size14.7 MiB
2025-01-02T18:29:56.466843image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length122
Median length118
Mean length23.03717644
Min length1

Characters and Unicode

Total characters697750
Distinct characters78
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3165 ?
Unique (%)10.4%

Sample

1st rowExtended About 16 Km Offshore From Crystal River Power Plant
2nd row0.8 mile west of Montgomery-Polk county line, north side of
3rd rowSan Andreas Fault
4th row6 Mile W Of Watsonville
5th rowfrom Holt data card
ValueCountFrequency (%)
approximate 9789
 
8.9%
from 6478
 
5.9%
river 3464
 
3.2%
of 3097
 
2.8%
about 3076
 
2.8%
16 2974
 
2.7%
km 2970
 
2.7%
plant 2933
 
2.7%
offshore 2929
 
2.7%
power 2929
 
2.7%
Other values (4971) 68760
62.9%
2025-01-02T18:29:56.706667image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
79111
 
11.3%
a 60517
 
8.7%
e 55652
 
8.0%
o 49194
 
7.1%
r 47507
 
6.8%
t 40249
 
5.8%
i 29470
 
4.2%
n 26681
 
3.8%
p 24672
 
3.5%
m 24234
 
3.5%
Other values (68) 260463
37.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 697750
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
79111
 
11.3%
a 60517
 
8.7%
e 55652
 
8.0%
o 49194
 
7.1%
r 47507
 
6.8%
t 40249
 
5.8%
i 29470
 
4.2%
n 26681
 
3.8%
p 24672
 
3.5%
m 24234
 
3.5%
Other values (68) 260463
37.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 697750
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
79111
 
11.3%
a 60517
 
8.7%
e 55652
 
8.0%
o 49194
 
7.1%
r 47507
 
6.8%
t 40249
 
5.8%
i 29470
 
4.2%
n 26681
 
3.8%
p 24672
 
3.5%
m 24234
 
3.5%
Other values (68) 260463
37.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 697750
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
79111
 
11.3%
a 60517
 
8.7%
e 55652
 
8.0%
o 49194
 
7.1%
r 47507
 
6.8%
t 40249
 
5.8%
i 29470
 
4.2%
n 26681
 
3.8%
p 24672
 
3.5%
m 24234
 
3.5%
Other values (68) 260463
37.3%

geologicalContextID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

earliestEonOrLowestEonothem
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

latestEonOrHighestEonothem
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

earliestEraOrLowestErathem
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

latestEraOrHighestErathem
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

earliestPeriodOrLowestSystem
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

latestPeriodOrHighestSystem
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

earliestEpochOrLowestSeries
Real number (ℝ)

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean4493591.5
Minimum2504455
Maximum6482728
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:56.764606image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum2504455
5-th percentile2703368.65
Q13499023.25
median4493591.5
Q35488159.75
95-th percentile6283814.35
Maximum6482728
Range3978273
Interquartile range (IQR)1989136.5

Descriptive statistics

Standard deviation2813063.816
Coefficient of variation (CV)0.6260168099
Kurtosisnan
Mean4493591.5
Median Absolute Deviation (MAD)1989136.5
Skewnessnan
Sum8987183
Variance7.913328031 × 1012
MonotonicityStrictly decreasing
2025-01-02T18:29:56.815826image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
6482728 1
 
< 0.1%
2504455 1
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
2504455 1
< 0.1%
6482728 1
< 0.1%
ValueCountFrequency (%)
6482728 1
< 0.1%
2504455 1
< 0.1%

latestEpochOrHighestSeries
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

earliestAgeOrLowestStage
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

latestAgeOrHighestStage
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

lowestBiostratigraphicZone
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

highestBiostratigraphicZone
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB
Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:56.881137image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length46
Median length44
Mean length44
Min length42

Characters and Unicode

Total characters88
Distinct characters32
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowHemionchos striatus Campbell & Beveridge, 2006
2nd rowConspicuum icteridorum Denton & Byrd, 1951
ValueCountFrequency (%)
2
16.7%
striatus 1
8.3%
hemionchos 1
8.3%
campbell 1
8.3%
beveridge 1
8.3%
2006 1
8.3%
conspicuum 1
8.3%
icteridorum 1
8.3%
denton 1
8.3%
byrd 1
8.3%
2025-01-02T18:29:57.013685image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
11.4%
e 7
 
8.0%
i 6
 
6.8%
o 5
 
5.7%
r 5
 
5.7%
s 4
 
4.5%
n 4
 
4.5%
m 4
 
4.5%
u 4
 
4.5%
t 4
 
4.5%
Other values (22) 35
39.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 88
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
10
 
11.4%
e 7
 
8.0%
i 6
 
6.8%
o 5
 
5.7%
r 5
 
5.7%
s 4
 
4.5%
n 4
 
4.5%
m 4
 
4.5%
u 4
 
4.5%
t 4
 
4.5%
Other values (22) 35
39.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 88
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
10
 
11.4%
e 7
 
8.0%
i 6
 
6.8%
o 5
 
5.7%
r 5
 
5.7%
s 4
 
4.5%
n 4
 
4.5%
m 4
 
4.5%
u 4
 
4.5%
t 4
 
4.5%
Other values (22) 35
39.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 88
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
10
 
11.4%
e 7
 
8.0%
i 6
 
6.8%
o 5
 
5.7%
r 5
 
5.7%
s 4
 
4.5%
n 4
 
4.5%
m 4
 
4.5%
u 4
 
4.5%
t 4
 
4.5%
Other values (22) 35
39.8%

group
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

formation
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

member
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

bed
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

identificationID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

verbatimIdentification
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB
Distinct7
Distinct (%)< 0.1%
Missing1908256
Missing (%)99.1%
Memory size14.7 MiB
2025-01-02T18:29:57.066409image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length76
Median length3
Mean length3.553796945
Min length3

Characters and Unicode

Total characters64441
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st rowcf.
2nd rowcf.
3rd rowuncertain
4th rowcf.
5th rowcf.
ValueCountFrequency (%)
cf 15638
86.2%
uncertain 1489
 
8.2%
aff 600
 
3.3%
near 404
 
2.2%
animalia 2
 
< 0.1%
platyhelminthes 2
 
< 0.1%
cestoda 1
 
< 0.1%
trematoda 1
 
< 0.1%
digenea 1
 
< 0.1%
plagiorchiida 1
 
< 0.1%
2025-01-02T18:29:57.180703image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
c 17130
26.6%
f 16838
26.1%
. 16238
25.2%
n 3387
 
5.3%
a 2506
 
3.9%
e 1903
 
3.0%
r 1896
 
2.9%
i 1502
 
2.3%
t 1495
 
2.3%
u 1487
 
2.3%
Other values (16) 59
 
0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 64441
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
c 17130
26.6%
f 16838
26.1%
. 16238
25.2%
n 3387
 
5.3%
a 2506
 
3.9%
e 1903
 
3.0%
r 1896
 
2.9%
i 1502
 
2.3%
t 1495
 
2.3%
u 1487
 
2.3%
Other values (16) 59
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 64441
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
c 17130
26.6%
f 16838
26.1%
. 16238
25.2%
n 3387
 
5.3%
a 2506
 
3.9%
e 1903
 
3.0%
r 1896
 
2.9%
i 1502
 
2.3%
t 1495
 
2.3%
u 1487
 
2.3%
Other values (16) 59
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 64441
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
c 17130
26.6%
f 16838
26.1%
. 16238
25.2%
n 3387
 
5.3%
a 2506
 
3.9%
e 1903
 
3.0%
r 1896
 
2.9%
i 1502
 
2.3%
t 1495
 
2.3%
u 1487
 
2.3%
Other values (16) 59
 
0.1%

typeStatus
Text

Missing 

Distinct11
Distinct (%)< 0.1%
Missing1841062
Missing (%)95.6%
Memory size14.7 MiB
2025-01-02T18:29:57.242328image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length8
Mean length7.724987401
Min length4

Characters and Unicode

Total characters659150
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPARATYPE
2nd rowHOLOTYPE
3rd rowPARATYPE
4th rowHOLOTYPE
5th rowPARATYPE
ValueCountFrequency (%)
paratype 40578
47.6%
holotype 25358
29.7%
syntype 9555
 
11.2%
type 4807
 
5.6%
allotype 2818
 
3.3%
lectotype 862
 
1.0%
paralectotype 795
 
0.9%
neotype 294
 
0.3%
hapantotype 242
 
0.3%
paraneotype 16
 
< 0.1%
2025-01-02T18:29:57.355317image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
P 126956
19.3%
Y 94880
14.4%
E 87292
13.2%
T 87224
13.2%
A 86082
13.1%
O 55743
8.5%
R 41389
 
6.3%
L 32651
 
5.0%
H 25600
 
3.9%
N 10107
 
1.5%
Other values (7) 11226
 
1.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 659150
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
P 126956
19.3%
Y 94880
14.4%
E 87292
13.2%
T 87224
13.2%
A 86082
13.1%
O 55743
8.5%
R 41389
 
6.3%
L 32651
 
5.0%
H 25600
 
3.9%
N 10107
 
1.5%
Other values (7) 11226
 
1.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 659150
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
P 126956
19.3%
Y 94880
14.4%
E 87292
13.2%
T 87224
13.2%
A 86082
13.1%
O 55743
8.5%
R 41389
 
6.3%
L 32651
 
5.0%
H 25600
 
3.9%
N 10107
 
1.5%
Other values (7) 11226
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 659150
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
P 126956
19.3%
Y 94880
14.4%
E 87292
13.2%
T 87224
13.2%
A 86082
13.1%
O 55743
8.5%
R 41389
 
6.3%
L 32651
 
5.0%
H 25600
 
3.9%
N 10107
 
1.5%
Other values (7) 11226
 
1.7%

identifiedBy
Text

Missing 

Distinct13461
Distinct (%)1.6%
Missing1085204
Missing (%)56.3%
Memory size14.7 MiB
2025-01-02T18:29:57.509814image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length226
Median length133
Mean length38.24106825
Min length2

Characters and Unicode

Total characters32167813
Distinct characters94
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4200 ?
Unique (%)0.5%

Sample

1st rowOpresko, Dennis M., Oak Ridge National Laboratory (UNITED STATES)
2nd rowNance
3rd rowMah, Christopher, (IZ), Smithsonian Institution - National Museum of Natural History (UNITED STATES)
4th rowVerrill, Addison E., Peabody Museum, Yale
5th rowJudkins, D.
ValueCountFrequency (%)
of 247193
 
5.3%
museum 200643
 
4.3%
national 197127
 
4.2%
institution 188591
 
4.1%
smithsonian 186061
 
4.0%
natural 185777
 
4.0%
history 185423
 
4.0%
united 130413
 
2.8%
states 129643
 
2.8%
87200
 
1.9%
Other values (9433) 2904278
62.6%
2025-01-02T18:29:57.750259image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3801164
 
11.8%
a 2080528
 
6.5%
i 2056250
 
6.4%
t 2013216
 
6.3%
n 1896071
 
5.9%
o 1744817
 
5.4%
e 1500120
 
4.7%
r 1384928
 
4.3%
s 1382760
 
4.3%
, 1349377
 
4.2%
Other values (84) 12958582
40.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 32167813
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
3801164
 
11.8%
a 2080528
 
6.5%
i 2056250
 
6.4%
t 2013216
 
6.3%
n 1896071
 
5.9%
o 1744817
 
5.4%
e 1500120
 
4.7%
r 1384928
 
4.3%
s 1382760
 
4.3%
, 1349377
 
4.2%
Other values (84) 12958582
40.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 32167813
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
3801164
 
11.8%
a 2080528
 
6.5%
i 2056250
 
6.4%
t 2013216
 
6.3%
n 1896071
 
5.9%
o 1744817
 
5.4%
e 1500120
 
4.7%
r 1384928
 
4.3%
s 1382760
 
4.3%
, 1349377
 
4.2%
Other values (84) 12958582
40.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 32167813
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
3801164
 
11.8%
a 2080528
 
6.5%
i 2056250
 
6.4%
t 2013216
 
6.3%
n 1896071
 
5.9%
o 1744817
 
5.4%
e 1500120
 
4.7%
r 1384928
 
4.3%
s 1382760
 
4.3%
, 1349377
 
4.2%
Other values (84) 12958582
40.3%

identifiedByID
Text

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:57.816360image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length9
Median length8
Mean length8
Min length7

Characters and Unicode

Total characters16
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowCestoda
2nd rowTrematoda
ValueCountFrequency (%)
cestoda 1
50.0%
trematoda 1
50.0%
2025-01-02T18:29:57.977581image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 3
18.8%
e 2
12.5%
d 2
12.5%
t 2
12.5%
o 2
12.5%
C 1
 
6.2%
s 1
 
6.2%
T 1
 
6.2%
r 1
 
6.2%
m 1
 
6.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 16
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 3
18.8%
e 2
12.5%
d 2
12.5%
t 2
12.5%
o 2
12.5%
C 1
 
6.2%
s 1
 
6.2%
T 1
 
6.2%
r 1
 
6.2%
m 1
 
6.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 16
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 3
18.8%
e 2
12.5%
d 2
12.5%
t 2
12.5%
o 2
12.5%
C 1
 
6.2%
s 1
 
6.2%
T 1
 
6.2%
r 1
 
6.2%
m 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 16
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 3
18.8%
e 2
12.5%
d 2
12.5%
t 2
12.5%
o 2
12.5%
C 1
 
6.2%
s 1
 
6.2%
T 1
 
6.2%
r 1
 
6.2%
m 1
 
6.2%

dateIdentified
Text

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:58.037430image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length14
Median length13.5
Mean length13.5
Min length13

Characters and Unicode

Total characters27
Distinct characters14
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowTrypanorhyncha
2nd rowPlagiorchiida
ValueCountFrequency (%)
trypanorhyncha 1
50.0%
plagiorchiida 1
50.0%
2025-01-02T18:29:58.156808image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 4
14.8%
r 3
11.1%
i 3
11.1%
h 3
11.1%
n 2
7.4%
y 2
7.4%
c 2
7.4%
o 2
7.4%
p 1
 
3.7%
T 1
 
3.7%
Other values (4) 4
14.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 27
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 4
14.8%
r 3
11.1%
i 3
11.1%
h 3
11.1%
n 2
7.4%
y 2
7.4%
c 2
7.4%
o 2
7.4%
p 1
 
3.7%
T 1
 
3.7%
Other values (4) 4
14.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 27
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 4
14.8%
r 3
11.1%
i 3
11.1%
h 3
11.1%
n 2
7.4%
y 2
7.4%
c 2
7.4%
o 2
7.4%
p 1
 
3.7%
T 1
 
3.7%
Other values (4) 4
14.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 27
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 4
14.8%
r 3
11.1%
i 3
11.1%
h 3
11.1%
n 2
7.4%
y 2
7.4%
c 2
7.4%
o 2
7.4%
p 1
 
3.7%
T 1
 
3.7%
Other values (4) 4
14.8%

identificationReferences
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB
Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:58.219719image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length15.5
Mean length15.5
Min length14

Characters and Unicode

Total characters31
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowEutetrarhynchidae
2nd rowDicrocoeliidae
ValueCountFrequency (%)
eutetrarhynchidae 1
50.0%
dicrocoeliidae 1
50.0%
2025-01-02T18:29:58.346751image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 4
12.9%
i 4
12.9%
r 3
9.7%
c 3
9.7%
a 3
9.7%
t 2
 
6.5%
h 2
 
6.5%
o 2
 
6.5%
d 2
 
6.5%
E 1
 
3.2%
Other values (5) 5
16.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 31
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 4
12.9%
i 4
12.9%
r 3
9.7%
c 3
9.7%
a 3
9.7%
t 2
 
6.5%
h 2
 
6.5%
o 2
 
6.5%
d 2
 
6.5%
E 1
 
3.2%
Other values (5) 5
16.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 31
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 4
12.9%
i 4
12.9%
r 3
9.7%
c 3
9.7%
a 3
9.7%
t 2
 
6.5%
h 2
 
6.5%
o 2
 
6.5%
d 2
 
6.5%
E 1
 
3.2%
Other values (5) 5
16.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 31
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 4
12.9%
i 4
12.9%
r 3
9.7%
c 3
9.7%
a 3
9.7%
t 2
 
6.5%
h 2
 
6.5%
o 2
 
6.5%
d 2
 
6.5%
E 1
 
3.2%
Other values (5) 5
16.1%

identificationRemarks
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

taxonID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

scientificNameID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

acceptedNameUsageID
Unsupported

Rejected  Unsupported 

Missing2065
Missing (%)0.1%
Memory size14.7 MiB

parentNameUsageID
Text

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:58.403843image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters20
Distinct characters12
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowHemionchos
2nd rowConspicuum
ValueCountFrequency (%)
hemionchos 1
50.0%
conspicuum 1
50.0%
2025-01-02T18:29:58.610017image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 3
15.0%
i 2
10.0%
n 2
10.0%
m 2
10.0%
u 2
10.0%
s 2
10.0%
c 2
10.0%
e 1
 
5.0%
H 1
 
5.0%
h 1
 
5.0%
Other values (2) 2
10.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 20
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 3
15.0%
i 2
10.0%
n 2
10.0%
m 2
10.0%
u 2
10.0%
s 2
10.0%
c 2
10.0%
e 1
 
5.0%
H 1
 
5.0%
h 1
 
5.0%
Other values (2) 2
10.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 20
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 3
15.0%
i 2
10.0%
n 2
10.0%
m 2
10.0%
u 2
10.0%
s 2
10.0%
c 2
10.0%
e 1
 
5.0%
H 1
 
5.0%
h 1
 
5.0%
Other values (2) 2
10.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 20
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 3
15.0%
i 2
10.0%
n 2
10.0%
m 2
10.0%
u 2
10.0%
s 2
10.0%
c 2
10.0%
e 1
 
5.0%
H 1
 
5.0%
h 1
 
5.0%
Other values (2) 2
10.0%

originalNameUsageID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

nameAccordingToID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

namePublishedInID
Text

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:58.669732image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length11
Median length9.5
Mean length9.5
Min length8

Characters and Unicode

Total characters19
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowstriatus
2nd rowicteridorum
ValueCountFrequency (%)
striatus 1
50.0%
icteridorum 1
50.0%
2025-01-02T18:29:58.793449image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 3
15.8%
r 3
15.8%
i 3
15.8%
s 2
10.5%
u 2
10.5%
a 1
 
5.3%
c 1
 
5.3%
e 1
 
5.3%
d 1
 
5.3%
o 1
 
5.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t 3
15.8%
r 3
15.8%
i 3
15.8%
s 2
10.5%
u 2
10.5%
a 1
 
5.3%
c 1
 
5.3%
e 1
 
5.3%
d 1
 
5.3%
o 1
 
5.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t 3
15.8%
r 3
15.8%
i 3
15.8%
s 2
10.5%
u 2
10.5%
a 1
 
5.3%
c 1
 
5.3%
e 1
 
5.3%
d 1
 
5.3%
o 1
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t 3
15.8%
r 3
15.8%
i 3
15.8%
s 2
10.5%
u 2
10.5%
a 1
 
5.3%
c 1
 
5.3%
e 1
 
5.3%
d 1
 
5.3%
o 1
 
5.3%

taxonConceptID
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB
Distinct113079
Distinct (%)5.9%
Missing2
Missing (%)< 0.1%
Memory size14.7 MiB
2025-01-02T18:29:58.959320image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length168
Median length102
Mean length29.16433821
Min length5

Characters and Unicode

Total characters56181802
Distinct characters116
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38721 ?
Unique (%)2.0%

Sample

1st rowScypha Gray, 1821
2nd rowBulla striata Bruguière, 1792
3rd rowStylopathes columnaris (Duchassaing, 1870)
4th rowOphiothrix suensonii Lütken, 1856
5th rowCypraea labrolineata Gaskoin, 1849
ValueCountFrequency (%)
136410
 
2.0%
linnaeus 96753
 
1.4%
1758 81495
 
1.2%
say 50998
 
0.8%
lamarck 40009
 
0.6%
dall 28184
 
0.4%
conus 24224
 
0.4%
gastropoda 23786
 
0.4%
1791 23649
 
0.3%
gmelin 23215
 
0.3%
Other values (70965) 6239236
92.2%
2025-01-02T18:29:59.205354image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 4939373
 
8.8%
4841572
 
8.6%
i 3725884
 
6.6%
e 3410133
 
6.1%
r 2844813
 
5.1%
s 2669041
 
4.8%
o 2472444
 
4.4%
l 2451221
 
4.4%
n 2432205
 
4.3%
t 1939529
 
3.5%
Other values (106) 24455587
43.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 56181802
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 4939373
 
8.8%
4841572
 
8.6%
i 3725884
 
6.6%
e 3410133
 
6.1%
r 2844813
 
5.1%
s 2669041
 
4.8%
o 2472444
 
4.4%
l 2451221
 
4.4%
n 2432205
 
4.3%
t 1939529
 
3.5%
Other values (106) 24455587
43.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 56181802
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 4939373
 
8.8%
4841572
 
8.6%
i 3725884
 
6.6%
e 3410133
 
6.1%
r 2844813
 
5.1%
s 2669041
 
4.8%
o 2472444
 
4.4%
l 2451221
 
4.4%
n 2432205
 
4.3%
t 1939529
 
3.5%
Other values (106) 24455587
43.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 56181802
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 4939373
 
8.8%
4841572
 
8.6%
i 3725884
 
6.6%
e 3410133
 
6.1%
r 2844813
 
5.1%
s 2669041
 
4.8%
o 2472444
 
4.4%
l 2451221
 
4.4%
n 2432205
 
4.3%
t 1939529
 
3.5%
Other values (106) 24455587
43.5%

acceptedNameUsage
Text

Constant  Missing 

Distinct1
Distinct (%)50.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:59.269525image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters14
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSPECIES
2nd rowSPECIES
ValueCountFrequency (%)
species 2
100.0%
2025-01-02T18:29:59.375144image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
S 4
28.6%
E 4
28.6%
P 2
14.3%
C 2
14.3%
I 2
14.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 14
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
S 4
28.6%
E 4
28.6%
P 2
14.3%
C 2
14.3%
I 2
14.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 14
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
S 4
28.6%
E 4
28.6%
P 2
14.3%
C 2
14.3%
I 2
14.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 14
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
S 4
28.6%
E 4
28.6%
P 2
14.3%
C 2
14.3%
I 2
14.3%

parentNameUsage
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

originalNameUsage
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

nameAccordingTo
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

namePublishedIn
Text

Constant  Missing 

Distinct1
Distinct (%)50.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:29:59.427868image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters16
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowACCEPTED
2nd rowACCEPTED
ValueCountFrequency (%)
accepted 2
100.0%
2025-01-02T18:29:59.532106image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 4
25.0%
E 4
25.0%
A 2
12.5%
P 2
12.5%
T 2
12.5%
D 2
12.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 16
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
C 4
25.0%
E 4
25.0%
A 2
12.5%
P 2
12.5%
T 2
12.5%
D 2
12.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 16
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
C 4
25.0%
E 4
25.0%
A 2
12.5%
P 2
12.5%
T 2
12.5%
D 2
12.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 16
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
C 4
25.0%
E 4
25.0%
A 2
12.5%
P 2
12.5%
T 2
12.5%
D 2
12.5%

namePublishedInYear
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB
Distinct4354
Distinct (%)0.2%
Missing465
Missing (%)< 0.1%
Memory size14.7 MiB
2025-01-02T18:29:59.648965image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length134
Median length117
Mean length62.96739176
Min length7

Characters and Unicode

Total characters121270411
Distinct characters60
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique586 ?
Unique (%)< 0.1%

Sample

1st rowAnimalia, Porifera, Calcarea
2nd rowAnimalia, Mollusca, Gastropoda, Bullidae
3rd rowAnimalia, Cnidaria, Anthozoa, Hexacorallia, Antipatharia, Stylopathidae
4th rowAnimalia, Echinodermata, Ophiuroidea, Ophiurida, Ophiotrichidae
5th rowAnimalia, Mollusca, Gastropoda, Cypraeidae
ValueCountFrequency (%)
animalia 1922044
 
18.1%
mollusca 866407
 
8.1%
gastropoda 612759
 
5.8%
arthropoda 390750
 
3.7%
crustacea 385110
 
3.6%
malacostraca 301975
 
2.8%
eumalacostraca 294895
 
2.8%
annelida 241801
 
2.3%
polychaeta 212969
 
2.0%
bivalvia 207685
 
2.0%
Other values (4342) 5202802
48.9%
2025-01-02T18:29:59.856866image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 19360559
16.0%
i 10629446
 
8.8%
8713273
 
7.2%
, 8691731
 
7.2%
o 7923783
 
6.5%
l 7526240
 
6.2%
e 6162876
 
5.1%
d 5675251
 
4.7%
r 5612652
 
4.6%
c 5023755
 
4.1%
Other values (50) 35950845
29.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 121270411
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 19360559
16.0%
i 10629446
 
8.8%
8713273
 
7.2%
, 8691731
 
7.2%
o 7923783
 
6.5%
l 7526240
 
6.2%
e 6162876
 
5.1%
d 5675251
 
4.7%
r 5612652
 
4.6%
c 5023755
 
4.1%
Other values (50) 35950845
29.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 121270411
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 19360559
16.0%
i 10629446
 
8.8%
8713273
 
7.2%
, 8691731
 
7.2%
o 7923783
 
6.5%
l 7526240
 
6.2%
e 6162876
 
5.1%
d 5675251
 
4.7%
r 5612652
 
4.6%
c 5023755
 
4.1%
Other values (50) 35950845
29.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 121270411
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 19360559
16.0%
i 10629446
 
8.8%
8713273
 
7.2%
, 8691731
 
7.2%
o 7923783
 
6.5%
l 7526240
 
6.2%
e 6162876
 
5.1%
d 5675251
 
4.7%
r 5612652
 
4.6%
c 5023755
 
4.1%
Other values (50) 35950845
29.6%
Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:29:59.929019image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length36
Median length8
Mean length8.007927786
Min length8

Characters and Unicode

Total characters15426384
Distinct characters31
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAnimalia
2nd rowAnimalia
3rd rowAnimalia
4th rowAnimalia
5th rowAnimalia
ValueCountFrequency (%)
animalia 1920497
99.6%
chromista 2826
 
0.1%
incertae 2065
 
0.1%
sedis 2065
 
0.1%
protozoa 964
 
< 0.1%
bacteria 35
 
< 0.1%
821cc27a-e3bb-4bc5-ac34-89ada245069d 2
 
< 0.1%
2025-01-02T18:30:00.048300image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 3847985
24.9%
a 3846927
24.9%
m 1923323
12.5%
n 1922562
12.5%
A 1920497
12.4%
l 1920497
12.4%
s 6956
 
< 0.1%
e 6232
 
< 0.1%
t 5890
 
< 0.1%
r 5890
 
< 0.1%
Other values (21) 19625
 
0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 15426384
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
i 3847985
24.9%
a 3846927
24.9%
m 1923323
12.5%
n 1922562
12.5%
A 1920497
12.4%
l 1920497
12.4%
s 6956
 
< 0.1%
e 6232
 
< 0.1%
t 5890
 
< 0.1%
r 5890
 
< 0.1%
Other values (21) 19625
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 15426384
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
i 3847985
24.9%
a 3846927
24.9%
m 1923323
12.5%
n 1922562
12.5%
A 1920497
12.4%
l 1920497
12.4%
s 6956
 
< 0.1%
e 6232
 
< 0.1%
t 5890
 
< 0.1%
r 5890
 
< 0.1%
Other values (21) 19625
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 15426384
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
i 3847985
24.9%
a 3846927
24.9%
m 1923323
12.5%
n 1922562
12.5%
A 1920497
12.4%
l 1920497
12.4%
s 6956
 
< 0.1%
e 6232
 
< 0.1%
t 5890
 
< 0.1%
r 5890
 
< 0.1%
Other values (21) 19625
 
0.1%

phylum
Text

Distinct52
Distinct (%)< 0.1%
Missing3156
Missing (%)0.2%
Memory size14.7 MiB
2025-01-02T18:30:00.123312image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length8
Mean length8.850655641
Min length2

Characters and Unicode

Total characters17021873
Distinct characters40
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)< 0.1%

Sample

1st rowPorifera
2nd rowMollusca
3rd rowCnidaria
4th rowEchinodermata
5th rowMollusca
ValueCountFrequency (%)
mollusca 864192
44.9%
arthropoda 392999
20.4%
annelida 241615
 
12.6%
cnidaria 117703
 
6.1%
echinodermata 91212
 
4.7%
nematoda 68758
 
3.6%
platyhelminthes 45840
 
2.4%
porifera 32733
 
1.7%
chordata 19745
 
1.0%
sipuncula 10415
 
0.5%
Other values (42) 38021
 
2.0%
2025-01-02T18:30:00.275885image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 2238532
13.2%
l 2078076
12.2%
o 1907515
11.2%
r 1110893
 
6.5%
c 984617
 
5.8%
d 936895
 
5.5%
s 910659
 
5.3%
u 885746
 
5.2%
M 866329
 
5.1%
n 769092
 
4.5%
Other values (30) 4333519
25.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 17021873
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 2238532
13.2%
l 2078076
12.2%
o 1907515
11.2%
r 1110893
 
6.5%
c 984617
 
5.8%
d 936895
 
5.5%
s 910659
 
5.3%
u 885746
 
5.2%
M 866329
 
5.1%
n 769092
 
4.5%
Other values (30) 4333519
25.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 17021873
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 2238532
13.2%
l 2078076
12.2%
o 1907515
11.2%
r 1110893
 
6.5%
c 984617
 
5.8%
d 936895
 
5.5%
s 910659
 
5.3%
u 885746
 
5.2%
M 866329
 
5.1%
n 769092
 
4.5%
Other values (30) 4333519
25.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 17021873
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 2238532
13.2%
l 2078076
12.2%
o 1907515
11.2%
r 1110893
 
6.5%
c 984617
 
5.8%
d 936895
 
5.5%
s 910659
 
5.3%
u 885746
 
5.2%
M 866329
 
5.1%
n 769092
 
4.5%
Other values (30) 4333519
25.5%

class
Text

Missing 

Distinct116
Distinct (%)< 0.1%
Missing66153
Missing (%)3.4%
Memory size14.7 MiB
2025-01-02T18:30:00.361116image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length24
Median length19
Mean length10.05340075
Min length4

Characters and Unicode

Total characters18701698
Distinct characters54
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)< 0.1%

Sample

1st rowCalcarea
2nd rowGastropoda
3rd rowAnthozoa
4th rowOphiuroidea
5th rowGastropoda
ValueCountFrequency (%)
gastropoda 610123
32.8%
malacostraca 301912
16.2%
polychaeta 211086
 
11.3%
bivalvia 207854
 
11.2%
anthozoa 93050
 
5.0%
copepoda 46190
 
2.5%
chromadorea 42750
 
2.3%
clitellata 30336
 
1.6%
ophiuroidea 27087
 
1.5%
asteroidea 25635
 
1.4%
Other values (106) 264213
14.2%
2025-01-02T18:30:00.517864image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 4042336
21.6%
o 2534615
13.6%
t 1401870
 
7.5%
r 1169735
 
6.3%
s 1022238
 
5.5%
d 956343
 
5.1%
c 944030
 
5.0%
l 924665
 
4.9%
p 848962
 
4.5%
i 703184
 
3.8%
Other values (44) 4153720
22.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 18701698
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 4042336
21.6%
o 2534615
13.6%
t 1401870
 
7.5%
r 1169735
 
6.3%
s 1022238
 
5.5%
d 956343
 
5.1%
c 944030
 
5.0%
l 924665
 
4.9%
p 848962
 
4.5%
i 703184
 
3.8%
Other values (44) 4153720
22.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 18701698
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 4042336
21.6%
o 2534615
13.6%
t 1401870
 
7.5%
r 1169735
 
6.3%
s 1022238
 
5.5%
d 956343
 
5.1%
c 944030
 
5.0%
l 924665
 
4.9%
p 848962
 
4.5%
i 703184
 
3.8%
Other values (44) 4153720
22.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 18701698
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 4042336
21.6%
o 2534615
13.6%
t 1401870
 
7.5%
r 1169735
 
6.3%
s 1022238
 
5.5%
d 956343
 
5.1%
c 944030
 
5.0%
l 924665
 
4.9%
p 848962
 
4.5%
i 703184
 
3.8%
Other values (44) 4153720
22.2%

order
Text

Missing 

Distinct414
Distinct (%)< 0.1%
Missing329533
Missing (%)17.1%
Memory size14.7 MiB
2025-01-02T18:30:00.624487image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length22
Median length20
Mean length11.19175304
Min length5

Characters and Unicode

Total characters17871618
Distinct characters46
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)< 0.1%

Sample

1st rowLeucosolenida
2nd rowCephalaspidea
3rd rowAntipatharia
4th rowAmphilepidida
5th rowLittorinimorpha
ValueCountFrequency (%)
decapoda 196384
 
12.3%
neogastropoda 156428
 
9.8%
stylommatophora 116401
 
7.3%
littorinimorpha 113553
 
7.1%
phyllodocida 69439
 
4.3%
scleractinia 54200
 
3.4%
amphipoda 49533
 
3.1%
rhabditida 35176
 
2.2%
venerida 31275
 
2.0%
cardiida 30439
 
1.9%
Other values (404) 744028
46.6%
2025-01-02T18:30:00.807773image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 2716551
15.2%
o 2130021
11.9%
i 1739041
 
9.7%
d 1413506
 
7.9%
t 1052242
 
5.9%
p 961716
 
5.4%
r 907952
 
5.1%
e 872746
 
4.9%
c 825635
 
4.6%
l 796133
 
4.5%
Other values (36) 4456075
24.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 17871618
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 2716551
15.2%
o 2130021
11.9%
i 1739041
 
9.7%
d 1413506
 
7.9%
t 1052242
 
5.9%
p 961716
 
5.4%
r 907952
 
5.1%
e 872746
 
4.9%
c 825635
 
4.6%
l 796133
 
4.5%
Other values (36) 4456075
24.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 17871618
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 2716551
15.2%
o 2130021
11.9%
i 1739041
 
9.7%
d 1413506
 
7.9%
t 1052242
 
5.9%
p 961716
 
5.4%
r 907952
 
5.1%
e 872746
 
4.9%
c 825635
 
4.6%
l 796133
 
4.5%
Other values (36) 4456075
24.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 17871618
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 2716551
15.2%
o 2130021
11.9%
i 1739041
 
9.7%
d 1413506
 
7.9%
t 1052242
 
5.9%
p 961716
 
5.4%
r 907952
 
5.1%
e 872746
 
4.9%
c 825635
 
4.6%
l 796133
 
4.5%
Other values (36) 4456075
24.9%

superfamily
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

family
Text

Missing 

Distinct3522
Distinct (%)0.2%
Missing144484
Missing (%)7.5%
Memory size14.7 MiB
2025-01-02T18:30:00.921247image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length23
Median length21
Mean length11.20729837
Min length6

Characters and Unicode

Total characters19970341
Distinct characters52
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique272 ?
Unique (%)< 0.1%

Sample

1st rowSyconidae
2nd rowBullidae
3rd rowStylopathidae
4th rowOphiotrichidae
5th rowCypraeidae
ValueCountFrequency (%)
cambaridae 28956
 
1.6%
conidae 28425
 
1.6%
unionidae 26787
 
1.5%
muricidae 22783
 
1.3%
veneridae 18640
 
1.0%
cypraeidae 16831
 
0.9%
cerithiidae 16777
 
0.9%
spionidae 15856
 
0.9%
syllidae 14115
 
0.8%
pectinidae 12961
 
0.7%
Other values (3512) 1579774
88.7%
2025-01-02T18:30:01.108267image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 2971768
14.9%
a 2739603
13.7%
e 2656666
13.3%
d 2019505
10.1%
o 1034346
 
5.2%
l 1016313
 
5.1%
r 1015168
 
5.1%
n 842574
 
4.2%
t 674978
 
3.4%
c 545975
 
2.7%
Other values (42) 4453445
22.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19970341
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
i 2971768
14.9%
a 2739603
13.7%
e 2656666
13.3%
d 2019505
10.1%
o 1034346
 
5.2%
l 1016313
 
5.1%
r 1015168
 
5.1%
n 842574
 
4.2%
t 674978
 
3.4%
c 545975
 
2.7%
Other values (42) 4453445
22.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19970341
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
i 2971768
14.9%
a 2739603
13.7%
e 2656666
13.3%
d 2019505
10.1%
o 1034346
 
5.2%
l 1016313
 
5.1%
r 1015168
 
5.1%
n 842574
 
4.2%
t 674978
 
3.4%
c 545975
 
2.7%
Other values (42) 4453445
22.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19970341
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
i 2971768
14.9%
a 2739603
13.7%
e 2656666
13.3%
d 2019505
10.1%
o 1034346
 
5.2%
l 1016313
 
5.1%
r 1015168
 
5.1%
n 842574
 
4.2%
t 674978
 
3.4%
c 545975
 
2.7%
Other values (42) 4453445
22.3%

subfamily
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

tribe
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

subtribe
Text

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:30:01.227484image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length130
Median length89
Mean length89
Min length48

Characters and Unicode

Total characters178
Distinct characters21
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowOCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES;CONTINENT_INVALID
2nd rowOCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT
ValueCountFrequency (%)
occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid 1
50.0%
occurrence_status_inferred_from_individual_count 1
50.0%
2025-01-02T18:30:01.391439image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 17
9.6%
N 16
 
9.0%
E 16
 
9.0%
I 15
 
8.4%
R 13
 
7.3%
D 13
 
7.3%
T 13
 
7.3%
O 12
 
6.7%
C 12
 
6.7%
U 10
 
5.6%
Other values (11) 41
23.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 178
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
_ 17
9.6%
N 16
 
9.0%
E 16
 
9.0%
I 15
 
8.4%
R 13
 
7.3%
D 13
 
7.3%
T 13
 
7.3%
O 12
 
6.7%
C 12
 
6.7%
U 10
 
5.6%
Other values (11) 41
23.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 178
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
_ 17
9.6%
N 16
 
9.0%
E 16
 
9.0%
I 15
 
8.4%
R 13
 
7.3%
D 13
 
7.3%
T 13
 
7.3%
O 12
 
6.7%
C 12
 
6.7%
U 10
 
5.6%
Other values (11) 41
23.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 178
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
_ 17
9.6%
N 16
 
9.0%
E 16
 
9.0%
I 15
 
8.4%
R 13
 
7.3%
D 13
 
7.3%
T 13
 
7.3%
O 12
 
6.7%
C 12
 
6.7%
U 10
 
5.6%
Other values (11) 41
23.0%

genus
Text

Missing 

Distinct20787
Distinct (%)1.3%
Missing358040
Missing (%)18.6%
Memory size14.7 MiB
2025-01-02T18:30:01.517132image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length27
Median length23
Mean length9.482777111
Min length2

Characters and Unicode

Total characters14872304
Distinct characters52
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3152 ?
Unique (%)0.2%

Sample

1st rowSycon
2nd rowBulla
3rd rowStylopathes
4th rowOphiothrix
5th rowNaria
ValueCountFrequency (%)
conus 22884
 
1.5%
cerithium 8956
 
0.6%
cambarus 8948
 
0.6%
faxonius 8189
 
0.5%
procambarus 8096
 
0.5%
aricidea 5223
 
0.3%
nerita 4536
 
0.3%
nassarius 4534
 
0.3%
pagurus 4234
 
0.3%
elimia 4085
 
0.3%
Other values (20777) 1488664
94.9%
2025-01-02T18:30:01.699408image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1794417
 
12.1%
i 1296042
 
8.7%
o 1190619
 
8.0%
e 1030226
 
6.9%
r 967232
 
6.5%
l 958555
 
6.4%
s 949415
 
6.4%
n 726098
 
4.9%
t 714218
 
4.8%
u 705352
 
4.7%
Other values (42) 4540130
30.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 14872304
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 1794417
 
12.1%
i 1296042
 
8.7%
o 1190619
 
8.0%
e 1030226
 
6.9%
r 967232
 
6.5%
l 958555
 
6.4%
s 949415
 
6.4%
n 726098
 
4.9%
t 714218
 
4.8%
u 705352
 
4.7%
Other values (42) 4540130
30.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 14872304
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 1794417
 
12.1%
i 1296042
 
8.7%
o 1190619
 
8.0%
e 1030226
 
6.9%
r 967232
 
6.5%
l 958555
 
6.4%
s 949415
 
6.4%
n 726098
 
4.9%
t 714218
 
4.8%
u 705352
 
4.7%
Other values (42) 4540130
30.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 14872304
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 1794417
 
12.1%
i 1296042
 
8.7%
o 1190619
 
8.0%
e 1030226
 
6.9%
r 967232
 
6.5%
l 958555
 
6.4%
s 949415
 
6.4%
n 726098
 
4.9%
t 714218
 
4.8%
u 705352
 
4.7%
Other values (42) 4540130
30.5%

genericName
Text

Missing 

Distinct21084
Distinct (%)1.3%
Missing358039
Missing (%)18.6%
Memory size14.7 MiB
2025-01-02T18:30:01.825624image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length27
Median length23
Mean length9.309154844
Min length1

Characters and Unicode

Total characters14600013
Distinct characters54
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3830 ?
Unique (%)0.2%

Sample

1st rowScypha
2nd rowBulla
3rd rowStylopathes
4th rowOphiothrix
5th rowCypraea
ValueCountFrequency (%)
conus 24156
 
1.5%
cypraea 15390
 
1.0%
cambarus 10146
 
0.6%
cerithium 9393
 
0.6%
orconectes 8661
 
0.6%
procambarus 8047
 
0.5%
nassarius 6727
 
0.4%
lumbrineris 4967
 
0.3%
terebra 4662
 
0.3%
aricidea 4572
 
0.3%
Other values (21074) 1471629
93.8%
2025-01-02T18:30:02.020802image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1744079
 
11.9%
i 1263792
 
8.7%
o 1156021
 
7.9%
e 1016938
 
7.0%
r 967987
 
6.6%
s 938349
 
6.4%
l 915577
 
6.3%
t 706525
 
4.8%
n 704068
 
4.8%
u 686498
 
4.7%
Other values (44) 4500179
30.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 14600013
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 1744079
 
11.9%
i 1263792
 
8.7%
o 1156021
 
7.9%
e 1016938
 
7.0%
r 967987
 
6.6%
s 938349
 
6.4%
l 915577
 
6.3%
t 706525
 
4.8%
n 704068
 
4.8%
u 686498
 
4.7%
Other values (44) 4500179
30.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 14600013
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 1744079
 
11.9%
i 1263792
 
8.7%
o 1156021
 
7.9%
e 1016938
 
7.0%
r 967987
 
6.6%
s 938349
 
6.4%
l 915577
 
6.3%
t 706525
 
4.8%
n 704068
 
4.8%
u 686498
 
4.7%
Other values (44) 4500179
30.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 14600013
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 1744079
 
11.9%
i 1263792
 
8.7%
o 1156021
 
7.9%
e 1016938
 
7.0%
r 967987
 
6.6%
s 938349
 
6.4%
l 915577
 
6.3%
t 706525
 
4.8%
n 704068
 
4.8%
u 686498
 
4.7%
Other values (44) 4500179
30.8%

subgenus
Boolean

Constant  Missing 

Distinct1
Distinct (%)50.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
False
 
2
(Missing)
1926387 
ValueCountFrequency (%)
False 2
 
< 0.1%
(Missing) 1926387
> 99.9%
2025-01-02T18:30:02.081287image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

infragenericEpithet
Real number (ℝ)

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean4493591.5
Minimum2504455
Maximum6482728
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:02.123676image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum2504455
5-th percentile2703368.65
Q13499023.25
median4493591.5
Q35488159.75
95-th percentile6283814.35
Maximum6482728
Range3978273
Interquartile range (IQR)1989136.5

Descriptive statistics

Standard deviation2813063.816
Coefficient of variation (CV)0.6260168099
Kurtosisnan
Mean4493591.5
Median Absolute Deviation (MAD)1989136.5
Skewnessnan
Sum8987183
Variance7.913328031 × 1012
MonotonicityStrictly decreasing
2025-01-02T18:30:02.174928image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
6482728 1
 
< 0.1%
2504455 1
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
2504455 1
< 0.1%
6482728 1
< 0.1%
ValueCountFrequency (%)
6482728 1
< 0.1%
2504455 1
< 0.1%

specificEpithet
Text

Missing 

Distinct39412
Distinct (%)3.0%
Missing626794
Missing (%)32.5%
Memory size14.7 MiB
2025-01-02T18:30:02.318311image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length22
Median length19
Mean length8.507768189
Min length2

Characters and Unicode

Total characters11056653
Distinct characters38
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9920 ?
Unique (%)0.8%

Sample

1st rowstriata
2nd rowcolumnaris
3rd rowsuensonii
4th rowlabrolineata
5th rowheteractis
ValueCountFrequency (%)
gracilis 6098
 
0.5%
fragilis 3477
 
0.3%
affinis 3341
 
0.3%
elegans 3182
 
0.2%
aculeata 3066
 
0.2%
borealis 2967
 
0.2%
americanus 2637
 
0.2%
grandis 2519
 
0.2%
acutus 2312
 
0.2%
tenuis 2265
 
0.2%
Other values (39402) 1267731
97.5%
2025-01-02T18:30:02.543925image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1553197
14.0%
i 1250540
11.3%
s 956883
 
8.7%
e 779958
 
7.1%
r 771552
 
7.0%
t 706671
 
6.4%
u 704699
 
6.4%
n 690520
 
6.2%
l 660182
 
6.0%
c 552656
 
5.0%
Other values (28) 2429795
22.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 11056653
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 1553197
14.0%
i 1250540
11.3%
s 956883
 
8.7%
e 779958
 
7.1%
r 771552
 
7.0%
t 706671
 
6.4%
u 704699
 
6.4%
n 690520
 
6.2%
l 660182
 
6.0%
c 552656
 
5.0%
Other values (28) 2429795
22.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 11056653
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 1553197
14.0%
i 1250540
11.3%
s 956883
 
8.7%
e 779958
 
7.1%
r 771552
 
7.0%
t 706671
 
6.4%
u 704699
 
6.4%
n 690520
 
6.2%
l 660182
 
6.0%
c 552656
 
5.0%
Other values (28) 2429795
22.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 11056653
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 1553197
14.0%
i 1250540
11.3%
s 956883
 
8.7%
e 779958
 
7.1%
r 771552
 
7.0%
t 706671
 
6.4%
u 704699
 
6.4%
n 690520
 
6.2%
l 660182
 
6.0%
c 552656
 
5.0%
Other values (28) 2429795
22.0%

infraspecificEpithet
Text

Missing 

Distinct3653
Distinct (%)10.1%
Missing1890285
Missing (%)98.1%
Memory size14.7 MiB
2025-01-02T18:30:02.689622image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length18
Median length16
Mean length8.605777753
Min length1

Characters and Unicode

Total characters310703
Distinct characters29
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1259 ?
Unique (%)3.5%

Sample

1st rowconnectens
2nd rowlaevis
3rd rowschizodontia
4th rowantarctica
5th rowsayi
ValueCountFrequency (%)
acutus 1011
 
2.8%
radiata 616
 
1.7%
bartonii 521
 
1.4%
gibbosus 501
 
1.4%
appressa 443
 
1.2%
campanulatum 379
 
1.0%
longimanus 359
 
1.0%
carinata 350
 
1.0%
floridana 283
 
0.8%
trivolvis 273
 
0.8%
Other values (3643) 31368
86.9%
2025-01-02T18:30:02.904074image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 45988
14.8%
i 33598
10.8%
s 29641
9.5%
e 22986
 
7.4%
n 22086
 
7.1%
u 19813
 
6.4%
r 19186
 
6.2%
t 17670
 
5.7%
l 16838
 
5.4%
c 16647
 
5.4%
Other values (19) 66250
21.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 310703
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 45988
14.8%
i 33598
10.8%
s 29641
9.5%
e 22986
 
7.4%
n 22086
 
7.1%
u 19813
 
6.4%
r 19186
 
6.2%
t 17670
 
5.7%
l 16838
 
5.4%
c 16647
 
5.4%
Other values (19) 66250
21.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 310703
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 45988
14.8%
i 33598
10.8%
s 29641
9.5%
e 22986
 
7.4%
n 22086
 
7.1%
u 19813
 
6.4%
r 19186
 
6.2%
t 17670
 
5.7%
l 16838
 
5.4%
c 16647
 
5.4%
Other values (19) 66250
21.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 310703
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 45988
14.8%
i 33598
10.8%
s 29641
9.5%
e 22986
 
7.4%
n 22086
 
7.1%
u 19813
 
6.4%
r 19186
 
6.2%
t 17670
 
5.7%
l 16838
 
5.4%
c 16647
 
5.4%
Other values (19) 66250
21.3%

cultivarEpithet
Real number (ℝ)

Constant  Missing 

Distinct1
Distinct (%)50.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean108
Minimum108
Maximum108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:02.965538image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum108
5-th percentile108
Q1108
median108
Q3108
95-th percentile108
Maximum108
Range0
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0
Coefficient of variation (CV)0
Kurtosisnan
Mean108
Median Absolute Deviation (MAD)0
Skewnessnan
Sum216
Variance0
MonotonicityIncreasing
2025-01-02T18:30:03.011955image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=1)
ValueCountFrequency (%)
108 2
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
108 2
< 0.1%
ValueCountFrequency (%)
108 2
< 0.1%
Distinct13
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:03.164237image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length7
Mean length6.539587799
Min length3

Characters and Unicode

Total characters12597790
Distinct characters25
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st rowGENUS
2nd rowSPECIES
3rd rowSPECIES
4th rowSPECIES
5th rowSPECIES
ValueCountFrequency (%)
species 1263491
65.6%
genus 268755
 
14.0%
family 216656
 
11.2%
class 63569
 
3.3%
phylum 48164
 
2.5%
subspecies 32829
 
1.7%
order 26813
 
1.4%
kingdom 2836
 
0.1%
variety 2500
 
0.1%
form 773
 
< 0.1%
Other values (3) 3
 
< 0.1%
2025-01-02T18:30:03.289677image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
S 3021362
24.0%
E 2890709
22.9%
I 1518312
12.1%
C 1359889
10.8%
P 1344484
10.7%
U 349749
 
2.8%
L 328389
 
2.6%
A 282726
 
2.2%
N 271593
 
2.2%
G 271591
 
2.2%
Other values (15) 958986
 
7.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 12597790
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
S 3021362
24.0%
E 2890709
22.9%
I 1518312
12.1%
C 1359889
10.8%
P 1344484
10.7%
U 349749
 
2.8%
L 328389
 
2.6%
A 282726
 
2.2%
N 271593
 
2.2%
G 271591
 
2.2%
Other values (15) 958986
 
7.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 12597790
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
S 3021362
24.0%
E 2890709
22.9%
I 1518312
12.1%
C 1359889
10.8%
P 1344484
10.7%
U 349749
 
2.8%
L 328389
 
2.6%
A 282726
 
2.2%
N 271593
 
2.2%
G 271591
 
2.2%
Other values (15) 958986
 
7.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 12597790
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
S 3021362
24.0%
E 2890709
22.9%
I 1518312
12.1%
C 1359889
10.8%
P 1344484
10.7%
U 349749
 
2.8%
L 328389
 
2.6%
A 282726
 
2.2%
N 271593
 
2.2%
G 271591
 
2.2%
Other values (15) 958986
 
7.6%

verbatimTaxonRank
Real number (ℝ)

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean662.5
Minimum434
Maximum891
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:03.339464image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum434
5-th percentile456.85
Q1548.25
median662.5
Q3776.75
95-th percentile868.15
Maximum891
Range457
Interquartile range (IQR)228.5

Descriptive statistics

Standard deviation323.147799
Coefficient of variation (CV)0.4877702626
Kurtosisnan
Mean662.5
Median Absolute Deviation (MAD)228.5
Skewnessnan
Sum1325
Variance104424.5
MonotonicityStrictly decreasing
2025-01-02T18:30:03.384723image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
891 1
 
< 0.1%
434 1
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
434 1
< 0.1%
891 1
< 0.1%
ValueCountFrequency (%)
891 1
< 0.1%
434 1
< 0.1%

vernacularName
Real number (ℝ)

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean6190
Minimum5954
Maximum6426
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:03.430795image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum5954
5-th percentile5977.6
Q16072
median6190
Q36308
95-th percentile6402.4
Maximum6426
Range472
Interquartile range (IQR)236

Descriptive statistics

Standard deviation333.7544007
Coefficient of variation (CV)0.05391831999
Kurtosisnan
Mean6190
Median Absolute Deviation (MAD)236
Skewnessnan
Sum12380
Variance111392
MonotonicityStrictly increasing
2025-01-02T18:30:03.474947image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
5954 1
 
< 0.1%
6426 1
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
5954 1
< 0.1%
6426 1
< 0.1%
ValueCountFrequency (%)
6426 1
< 0.1%
5954 1
< 0.1%

nomenclaturalCode
Real number (ℝ)

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean4493589.5
Minimum2504454
Maximum6482725
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:03.520458image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum2504454
5-th percentile2703367.55
Q13499021.75
median4493589.5
Q35488157.25
95-th percentile6283811.45
Maximum6482725
Range3978271
Interquartile range (IQR)1989135.5

Descriptive statistics

Standard deviation2813062.401
Coefficient of variation (CV)0.6260167738
Kurtosisnan
Mean4493589.5
Median Absolute Deviation (MAD)1989135.5
Skewnessnan
Sum8987179
Variance7.913320075 × 1012
MonotonicityStrictly decreasing
2025-01-02T18:30:03.572278image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
6482725 1
 
< 0.1%
2504454 1
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
2504454 1
< 0.1%
6482725 1
< 0.1%
ValueCountFrequency (%)
6482725 1
< 0.1%
2504454 1
< 0.1%
Distinct3
Distinct (%)< 0.1%
Missing2067
Missing (%)0.1%
Memory size14.7 MiB
2025-01-02T18:30:03.633352image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.818195707
Min length7

Characters and Unicode

Total characters15044726
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSYNONYM
2nd rowACCEPTED
3rd rowACCEPTED
4th rowACCEPTED
5th rowSYNONYM
ValueCountFrequency (%)
accepted 1560511
81.1%
synonym 349850
 
18.2%
doubtful 13961
 
0.7%
2025-01-02T18:30:03.750246image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 3121022
20.7%
E 3121022
20.7%
T 1574472
10.5%
D 1574472
10.5%
A 1560511
10.4%
P 1560511
10.4%
Y 699700
 
4.7%
N 699700
 
4.7%
O 363811
 
2.4%
S 349850
 
2.3%
Other values (5) 419655
 
2.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 15044726
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
C 3121022
20.7%
E 3121022
20.7%
T 1574472
10.5%
D 1574472
10.5%
A 1560511
10.4%
P 1560511
10.4%
Y 699700
 
4.7%
N 699700
 
4.7%
O 363811
 
2.4%
S 349850
 
2.3%
Other values (5) 419655
 
2.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 15044726
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
C 3121022
20.7%
E 3121022
20.7%
T 1574472
10.5%
D 1574472
10.5%
A 1560511
10.4%
P 1560511
10.4%
Y 699700
 
4.7%
N 699700
 
4.7%
O 363811
 
2.4%
S 349850
 
2.3%
Other values (5) 419655
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 15044726
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
C 3121022
20.7%
E 3121022
20.7%
T 1574472
10.5%
D 1574472
10.5%
A 1560511
10.4%
P 1560511
10.4%
Y 699700
 
4.7%
N 699700
 
4.7%
O 363811
 
2.4%
S 349850
 
2.3%
Other values (5) 419655
 
2.8%

nomenclaturalStatus
Real number (ℝ)

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Infinite0
Infinite (%)0.0%
Mean4493591.5
Minimum2504455
Maximum6482728
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:03.799888image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum2504455
5-th percentile2703368.65
Q13499023.25
median4493591.5
Q35488159.75
95-th percentile6283814.35
Maximum6482728
Range3978273
Interquartile range (IQR)1989136.5

Descriptive statistics

Standard deviation2813063.816
Coefficient of variation (CV)0.6260168099
Kurtosisnan
Mean4493591.5
Median Absolute Deviation (MAD)1989136.5
Skewnessnan
Sum8987183
Variance7.913328031 × 1012
MonotonicityStrictly decreasing
2025-01-02T18:30:03.853418image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
6482728 1
 
< 0.1%
2504455 1
 
< 0.1%
(Missing) 1926387
> 99.9%
ValueCountFrequency (%)
2504455 1
< 0.1%
6482728 1
< 0.1%
ValueCountFrequency (%)
6482728 1
< 0.1%
2504455 1
< 0.1%

taxonRemarks
Text

Missing 

Distinct2
Distinct (%)100.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:30:03.918813image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length22
Median length20.5
Mean length20.5
Min length19

Characters and Unicode

Total characters41
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowHemionchos striatus
2nd rowConspicuum icteridorum
ValueCountFrequency (%)
hemionchos 1
25.0%
striatus 1
25.0%
conspicuum 1
25.0%
icteridorum 1
25.0%
2025-01-02T18:30:04.152198image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 5
12.2%
s 4
9.8%
o 4
9.8%
u 4
9.8%
c 3
 
7.3%
t 3
 
7.3%
r 3
 
7.3%
m 3
 
7.3%
e 2
 
4.9%
2
 
4.9%
Other values (7) 8
19.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 41
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
i 5
12.2%
s 4
9.8%
o 4
9.8%
u 4
9.8%
c 3
 
7.3%
t 3
 
7.3%
r 3
 
7.3%
m 3
 
7.3%
e 2
 
4.9%
2
 
4.9%
Other values (7) 8
19.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 41
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
i 5
12.2%
s 4
9.8%
o 4
9.8%
u 4
9.8%
c 3
 
7.3%
t 3
 
7.3%
r 3
 
7.3%
m 3
 
7.3%
e 2
 
4.9%
2
 
4.9%
Other values (7) 8
19.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 41
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
i 5
12.2%
s 4
9.8%
o 4
9.8%
u 4
9.8%
c 3
 
7.3%
t 3
 
7.3%
r 3
 
7.3%
m 3
 
7.3%
e 2
 
4.9%
2
 
4.9%
Other values (7) 8
19.5%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:04.234302image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length46
Median length36
Mean length36.00000831
Min length36

Characters and Unicode

Total characters69350020
Distinct characters37
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row821cc27a-e3bb-4bc5-ac34-89ada245069d
2nd row821cc27a-e3bb-4bc5-ac34-89ada245069d
3rd row821cc27a-e3bb-4bc5-ac34-89ada245069d
4th row821cc27a-e3bb-4bc5-ac34-89ada245069d
5th row821cc27a-e3bb-4bc5-ac34-89ada245069d
ValueCountFrequency (%)
821cc27a-e3bb-4bc5-ac34-89ada245069d 1926387
> 99.9%
2
 
< 0.1%
striatus 1
 
< 0.1%
hemionchos 1
 
< 0.1%
campbell 1
 
< 0.1%
beveridge 1
 
< 0.1%
2006 1
 
< 0.1%
conspicuum 1
 
< 0.1%
icteridorum 1
 
< 0.1%
denton 1
 
< 0.1%
Other values (2) 2
 
< 0.1%
2025-01-02T18:30:04.367526image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
c 7705551
11.1%
a 7705550
11.1%
- 7705548
11.1%
2 5779162
8.3%
b 5779162
8.3%
4 5779161
8.3%
d 3852777
 
5.6%
9 3852775
 
5.6%
5 3852775
 
5.6%
8 3852774
 
5.6%
Other values (27) 13484785
19.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 69350020
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
c 7705551
11.1%
a 7705550
11.1%
- 7705548
11.1%
2 5779162
8.3%
b 5779162
8.3%
4 5779161
8.3%
d 3852777
 
5.6%
9 3852775
 
5.6%
5 3852775
 
5.6%
8 3852774
 
5.6%
Other values (27) 13484785
19.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 69350020
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
c 7705551
11.1%
a 7705550
11.1%
- 7705548
11.1%
2 5779162
8.3%
b 5779162
8.3%
4 5779161
8.3%
d 3852777
 
5.6%
9 3852775
 
5.6%
5 3852775
 
5.6%
8 3852774
 
5.6%
Other values (27) 13484785
19.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 69350020
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
c 7705551
11.1%
a 7705550
11.1%
- 7705548
11.1%
2 5779162
8.3%
b 5779162
8.3%
4 5779161
8.3%
d 3852777
 
5.6%
9 3852775
 
5.6%
5 3852775
 
5.6%
8 3852774
 
5.6%
Other values (27) 13484785
19.4%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:04.415501image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length22
Median length2
Mean length2.000019207
Min length2

Characters and Unicode

Total characters3852815
Distinct characters19
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st rowUS
2nd rowUS
3rd rowUS
4th rowUS
5th rowUS
ValueCountFrequency (%)
us 1926387
> 99.9%
hemionchos 1
 
< 0.1%
striatus 1
 
< 0.1%
conspicuum 1
 
< 0.1%
icteridorum 1
 
< 0.1%
2025-01-02T18:30:04.528698image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
U 1926387
50.0%
S 1926387
50.0%
i 5
 
< 0.1%
s 4
 
< 0.1%
o 4
 
< 0.1%
u 4
 
< 0.1%
c 3
 
< 0.1%
t 3
 
< 0.1%
r 3
 
< 0.1%
m 3
 
< 0.1%
Other values (9) 12
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3852815
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
U 1926387
50.0%
S 1926387
50.0%
i 5
 
< 0.1%
s 4
 
< 0.1%
o 4
 
< 0.1%
u 4
 
< 0.1%
c 3
 
< 0.1%
t 3
 
< 0.1%
r 3
 
< 0.1%
m 3
 
< 0.1%
Other values (9) 12
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3852815
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
U 1926387
50.0%
S 1926387
50.0%
i 5
 
< 0.1%
s 4
 
< 0.1%
o 4
 
< 0.1%
u 4
 
< 0.1%
c 3
 
< 0.1%
t 3
 
< 0.1%
r 3
 
< 0.1%
m 3
 
< 0.1%
Other values (9) 12
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3852815
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
U 1926387
50.0%
S 1926387
50.0%
i 5
 
< 0.1%
s 4
 
< 0.1%
o 4
 
< 0.1%
u 4
 
< 0.1%
c 3
 
< 0.1%
t 3
 
< 0.1%
r 3
 
< 0.1%
m 3
 
< 0.1%
Other values (9) 12
 
< 0.1%
Distinct209948
Distinct (%)10.9%
Missing2
Missing (%)< 0.1%
Memory size14.7 MiB
2025-01-02T18:30:04.703276image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length24
Median length24
Mean length23.99591152
Min length20

Characters and Unicode

Total characters46225412
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9123 ?
Unique (%)0.5%

Sample

1st row2024-12-02T13:57:44.311Z
2nd row2024-12-02T13:57:20.485Z
3rd row2024-12-02T13:57:18.447Z
4th row2024-12-02T13:57:45.124Z
5th row2024-12-02T13:57:20.489Z
ValueCountFrequency (%)
2024-12-02t13:57:28.783z 37
 
< 0.1%
2024-12-02t13:57:52.889z 37
 
< 0.1%
2024-12-02t13:57:43.700z 36
 
< 0.1%
2024-12-02t13:57:40.815z 36
 
< 0.1%
2024-12-02t13:58:01.714z 36
 
< 0.1%
2024-12-02t13:57:50.671z 35
 
< 0.1%
2024-12-02t13:57:53.093z 35
 
< 0.1%
2024-12-02t13:57:40.927z 35
 
< 0.1%
2024-12-02t13:57:28.440z 35
 
< 0.1%
2024-12-02t13:57:33.269z 35
 
< 0.1%
Other values (209938) 1926030
> 99.9%
2025-01-02T18:30:04.927429image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 8796773
19.0%
0 4884695
10.6%
1 4858658
10.5%
- 3852774
8.3%
: 3852774
8.3%
4 3098095
 
6.7%
5 3058765
 
6.6%
3 3051121
 
6.6%
T 1926387
 
4.2%
Z 1926387
 
4.2%
Other values (5) 6918983
15.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 46225412
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 8796773
19.0%
0 4884695
10.6%
1 4858658
10.5%
- 3852774
8.3%
: 3852774
8.3%
4 3098095
 
6.7%
5 3058765
 
6.6%
3 3051121
 
6.6%
T 1926387
 
4.2%
Z 1926387
 
4.2%
Other values (5) 6918983
15.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 46225412
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 8796773
19.0%
0 4884695
10.6%
1 4858658
10.5%
- 3852774
8.3%
: 3852774
8.3%
4 3098095
 
6.7%
5 3058765
 
6.6%
3 3051121
 
6.6%
T 1926387
 
4.2%
Z 1926387
 
4.2%
Other values (5) 6918983
15.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 46225412
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 8796773
19.0%
0 4884695
10.6%
1 4858658
10.5%
- 3852774
8.3%
: 3852774
8.3%
4 3098095
 
6.7%
5 3058765
 
6.6%
3 3051121
 
6.6%
T 1926387
 
4.2%
Z 1926387
 
4.2%
Other values (5) 6918983
15.0%

elevation
Unsupported

Missing  Rejected  Unsupported 

Missing1919566
Missing (%)99.6%
Memory size14.7 MiB

elevationAccuracy
Unsupported

Missing  Rejected  Unsupported 

Missing1922884
Missing (%)99.8%
Memory size14.7 MiB

depth
Unsupported

Missing  Rejected  Unsupported 

Missing1143678
Missing (%)59.4%
Memory size14.7 MiB

depthAccuracy
Unsupported

Missing  Rejected  Unsupported 

Missing1205336
Missing (%)62.6%
Memory size14.7 MiB

distanceFromCentroidInMeters
Real number (ℝ)

Missing 

Distinct602
Distinct (%)6.8%
Missing1917542
Missing (%)99.5%
Infinite0
Infinite (%)0.0%
Mean1283.567939
Minimum0
Maximum4987.877694
Zeros2777
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:04.998584image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median511.1528955
Q31986.183866
95-th percentile4262.642359
Maximum4987.877694
Range4987.877694
Interquartile range (IQR)1986.183866

Descriptive statistics

Standard deviation1434.673942
Coefficient of variation (CV)1.117723416
Kurtosis-0.3400112764
Mean1283.567939
Median Absolute Deviation (MAD)511.1528955
Skewness0.935416819
Sum11355725.56
Variance2058289.319
MonotonicityNot monotonic
2025-01-02T18:30:05.069761image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2777
 
0.1%
511.1528955 887
 
< 0.1%
365.9456783 341
 
< 0.1%
1436.265125 162
 
< 0.1%
3843.282665 125
 
< 0.1%
3.650579246 104
 
< 0.1%
1878.902046 83
 
< 0.1%
1726.525481 80
 
< 0.1%
857.2535536 75
 
< 0.1%
1809.590416 71
 
< 0.1%
Other values (592) 4142
 
0.2%
(Missing) 1917542
99.5%
ValueCountFrequency (%)
0 2777
0.1%
7.810965062 × 10-101
 
< 0.1%
3.317440985 2
 
< 0.1%
3.591589964 3
 
< 0.1%
3.650579246 104
 
< 0.1%
ValueCountFrequency (%)
4987.877694 4
< 0.1%
4980.294784 1
 
< 0.1%
4973.555741 1
 
< 0.1%
4969.168143 2
< 0.1%
4964.795521 1
 
< 0.1%

issue
Text

Distinct401
Distinct (%)< 0.1%
Missing34
Missing (%)< 0.1%
Memory size14.7 MiB
2025-01-02T18:30:05.172142image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length209
Median length204
Mean length89.03875817
Min length16

Characters and Unicode

Total characters171520257
Distinct characters28
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)< 0.1%

Sample

1st rowOCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_INVALID
2nd rowOCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT
3rd rowOCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;CONTINENT_DERIVED_FROM_COUNTRY;CONTINENT_INVALID
4th rowOCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_INVALID
5th rowOCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;CONTINENT_DERIVED_FROM_COUNTRY
ValueCountFrequency (%)
occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_invalid 516478
26.8%
occurrence_status_inferred_from_individual_count 418366
21.7%
occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid 224592
11.7%
occurrence_status_inferred_from_individual_count;continent_derived_from_country;continent_invalid 212163
11.0%
occurrence_status_inferred_from_individual_count;continent_derived_from_country 195778
 
10.2%
occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates 50454
 
2.6%
occurrence_status_inferred_from_individual_count;taxon_match_higherrank 36575
 
1.9%
occurrence_status_inferred_from_individual_count;continent_derived_from_coordinates 32128
 
1.7%
occurrence_status_inferred_from_individual_count;country_derived_from_coordinates;geodetic_datum_assumed_wgs84;continent_invalid 27721
 
1.4%
occurrence_status_inferred_from_individual_count;continent_derived_from_country;taxon_match_higherrank 25845
 
1.3%
Other values (391) 186255
 
9.7%
2025-01-02T18:30:05.353303image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 16496084
9.6%
N 15722592
 
9.2%
E 14675877
 
8.6%
I 14123173
 
8.2%
T 12772362
 
7.4%
R 12496875
 
7.3%
D 11817130
 
6.9%
C 11680988
 
6.8%
O 10988569
 
6.4%
U 10155291
 
5.9%
Other values (18) 40591316
23.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 171520257
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
_ 16496084
9.6%
N 15722592
 
9.2%
E 14675877
 
8.6%
I 14123173
 
8.2%
T 12772362
 
7.4%
R 12496875
 
7.3%
D 11817130
 
6.9%
C 11680988
 
6.8%
O 10988569
 
6.4%
U 10155291
 
5.9%
Other values (18) 40591316
23.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 171520257
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
_ 16496084
9.6%
N 15722592
 
9.2%
E 14675877
 
8.6%
I 14123173
 
8.2%
T 12772362
 
7.4%
R 12496875
 
7.3%
D 11817130
 
6.9%
C 11680988
 
6.8%
O 10988569
 
6.4%
U 10155291
 
5.9%
Other values (18) 40591316
23.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 171520257
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
_ 16496084
9.6%
N 15722592
 
9.2%
E 14675877
 
8.6%
I 14123173
 
8.2%
T 12772362
 
7.4%
R 12496875
 
7.3%
D 11817130
 
6.9%
C 11680988
 
6.8%
O 10988569
 
6.4%
U 10155291
 
5.9%
Other values (18) 40591316
23.7%

mediaType
Text

Missing 

Distinct73
Distinct (%)< 0.1%
Missing1683237
Missing (%)87.4%
Memory size14.7 MiB
2025-01-02T18:30:05.419958image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length1704
Median length10
Mean length13.26034744
Min length5

Characters and Unicode

Total characters3224280
Distinct characters12
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)< 0.1%

Sample

1st rowStillImage
2nd rowStillImage
3rd rowStillImage
4th rowStillImage
5th rowStillImage
ValueCountFrequency (%)
stillimage 220054
90.5%
stillimage;stillimage 12696
 
5.2%
stillimage;stillimage;stillimage 3561
 
1.5%
stillimage;stillimage;stillimage;stillimage 2030
 
0.8%
stillimage;stillimage;stillimage;stillimage;stillimage 1055
 
0.4%
stillimage;stillimage;stillimage;stillimage;stillimage;stillimage 769
 
0.3%
stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage 533
 
0.2%
stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage 390
 
0.2%
stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage 309
 
0.1%
stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage 213
 
0.1%
Other values (63) 1542
 
0.6%
2025-01-02T18:30:05.564635image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
l 630442
19.6%
a 315222
9.8%
e 315222
9.8%
S 315220
9.8%
i 315220
9.8%
t 315220
9.8%
m 315220
9.8%
I 315220
9.8%
g 315220
9.8%
; 72070
 
2.2%
Other values (2) 4
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3224280
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
l 630442
19.6%
a 315222
9.8%
e 315222
9.8%
S 315220
9.8%
i 315220
9.8%
t 315220
9.8%
m 315220
9.8%
I 315220
9.8%
g 315220
9.8%
; 72070
 
2.2%
Other values (2) 4
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3224280
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
l 630442
19.6%
a 315222
9.8%
e 315222
9.8%
S 315220
9.8%
i 315220
9.8%
t 315220
9.8%
m 315220
9.8%
I 315220
9.8%
g 315220
9.8%
; 72070
 
2.2%
Other values (2) 4
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3224280
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
l 630442
19.6%
a 315222
9.8%
e 315222
9.8%
S 315220
9.8%
i 315220
9.8%
t 315220
9.8%
m 315220
9.8%
I 315220
9.8%
g 315220
9.8%
; 72070
 
2.2%
Other values (2) 4
 
< 0.1%

hasCoordinate
Unsupported

Rejected  Unsupported 

Missing0
Missing (%)0.0%
Memory size14.7 MiB

hasGeospatialIssues
Unsupported

Rejected  Unsupported 

Missing0
Missing (%)0.0%
Memory size14.7 MiB

taxonKey
Unsupported

Rejected  Unsupported 

Missing1
Missing (%)< 0.1%
Memory size14.7 MiB

acceptedTaxonKey
Unsupported

Rejected  Unsupported 

Missing2066
Missing (%)0.1%
Memory size14.7 MiB

kingdomKey
Unsupported

Rejected  Unsupported 

Missing1
Missing (%)< 0.1%
Memory size14.7 MiB

phylumKey
Unsupported

Rejected  Unsupported 

Missing3157
Missing (%)0.2%
Memory size14.7 MiB

classKey
Unsupported

Missing  Rejected  Unsupported 

Missing66154
Missing (%)3.4%
Memory size14.7 MiB

orderKey
Unsupported

Missing  Rejected  Unsupported 

Missing329532
Missing (%)17.1%
Memory size14.7 MiB

familyKey
Real number (ℝ)

Missing 

Distinct3524
Distinct (%)0.2%
Missing144484
Missing (%)7.5%
Infinite0
Infinite (%)0.0%
Mean734146.9195
Minimum1889
Maximum12366500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:05.629647image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1889
5-th percentile2622
Q13476
median6445
Q38120
95-th percentile5842517
Maximum12366500
Range12364611
Interquartile range (IQR)4644

Descriptive statistics

Standard deviation2032818.722
Coefficient of variation (CV)2.768953554
Kurtosis9.852825393
Mean734146.9195
Median Absolute Deviation (MAD)2583
Skewness3.131604259
Sum1.308180067 × 1012
Variance4.132351957 × 1012
MonotonicityNot monotonic
2025-01-02T18:30:05.699947image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4479 28956
 
1.5%
6779 28425
 
1.5%
3461 26787
 
1.4%
2304120 22783
 
1.2%
3445 18640
 
1.0%
2675 16831
 
0.9%
6760 16777
 
0.9%
3595 15856
 
0.8%
3588 14115
 
0.7%
3472 12961
 
0.7%
Other values (3514) 1579774
82.0%
(Missing) 144484
 
7.5%
ValueCountFrequency (%)
1889 2
 
< 0.1%
1895 19
< 0.1%
1897 17
< 0.1%
1904 9
< 0.1%
1905 21
< 0.1%
ValueCountFrequency (%)
12366500 38
< 0.1%
12265744 4
 
< 0.1%
12262968 34
< 0.1%
12256615 4
 
< 0.1%
12252285 10
 
< 0.1%

genusKey
Real number (ℝ)

Missing 

Distinct20899
Distinct (%)1.3%
Missing358040
Missing (%)18.6%
Infinite0
Infinite (%)0.0%
Mean3388202.448
Minimum1000426
Maximum12385823
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:05.767658image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1000426
5-th percentile2201601
Q12270059
median2300882
Q34329192
95-th percentile9090179
Maximum12385823
Range11385397
Interquartile range (IQR)2059133

Descriptive statistics

Standard deviation2154038.958
Coefficient of variation (CV)0.6357468276
Kurtosis3.248211713
Mean3388202.448
Median Absolute Deviation (MAD)73565
Skewness2.01671791
Sum5.313883922 × 1012
Variance4.639883831 × 1012
MonotonicityNot monotonic
2025-01-02T18:30:05.908348image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9819702 22884
 
1.2%
8179898 8956
 
0.5%
2227317 8948
 
0.5%
4646327 8189
 
0.4%
2227127 8096
 
0.4%
2318625 5223
 
0.3%
5189970 4536
 
0.2%
2302962 4534
 
0.2%
2224189 4234
 
0.2%
2301998 4085
 
0.2%
Other values (20889) 1488664
77.3%
(Missing) 358040
 
18.6%
ValueCountFrequency (%)
1000426 24
< 0.1%
1000452 2
 
< 0.1%
1000456 17
< 0.1%
1000486 1
 
< 0.1%
1000491 10
< 0.1%
ValueCountFrequency (%)
12385823 51
 
< 0.1%
12373983 1
 
< 0.1%
12364957 16
 
< 0.1%
12350721 196
< 0.1%
12349836 3
 
< 0.1%

subgenusKey
Text

Constant  Missing 

Distinct1
Distinct (%)50.0%
Missing1926387
Missing (%)> 99.9%
Memory size14.7 MiB
2025-01-02T18:30:05.945566image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters4
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNE
2nd rowNE
ValueCountFrequency (%)
ne 2
100.0%
2025-01-02T18:30:06.029609image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 2
50.0%
E 2
50.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N 2
50.0%
E 2
50.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N 2
50.0%
E 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N 2
50.0%
E 2
50.0%

speciesKey
Real number (ℝ)

Missing 

Distinct81479
Distinct (%)6.3%
Missing626818
Missing (%)32.5%
Infinite0
Infinite (%)0.0%
Mean4546707.336
Minimum1000431
Maximum12379492
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size14.7 MiB
2025-01-02T18:30:06.087270image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Quantile statistics

Minimum1000431
5-th percentile2214680
Q12286258
median4362204
Q35856710
95-th percentile9783587
Maximum12379492
Range11379061
Interquartile range (IQR)3570452

Descriptive statistics

Standard deviation2602320.923
Coefficient of variation (CV)0.5723528546
Kurtosis-0.07264411784
Mean4546707.336
Median Absolute Deviation (MAD)2072289
Skewness0.899847159
Sum5.908768999 × 1012
Variance6.772074185 × 1012
MonotonicityNot monotonic
2025-01-02T18:30:06.154838image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2318104 2020
 
0.1%
5728138 1518
 
0.1%
7823183 1512
 
0.1%
2321421 1479
 
0.1%
9029731 1415
 
0.1%
2227405 1414
 
0.1%
2227381 1402
 
0.1%
5724968 1368
 
0.1%
2509463 1354
 
0.1%
8971201 1324
 
0.1%
Other values (81469) 1284765
66.7%
(Missing) 626818
32.5%
ValueCountFrequency (%)
1000431 7
< 0.1%
1000432 4
< 0.1%
1000443 3
< 0.1%
1000447 2
 
< 0.1%
1000454 2
 
< 0.1%
ValueCountFrequency (%)
12379492 84
< 0.1%
12353298 5
 
< 0.1%
12345691 5
 
< 0.1%
12323041 1
 
< 0.1%
12312894 12
 
< 0.1%

species
Text

Missing 

Distinct81449
Distinct (%)6.3%
Missing626818
Missing (%)32.5%
Memory size14.7 MiB
2025-01-02T18:30:06.316314image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length41
Median length36
Mean length18.98173243
Min length7

Characters and Unicode

Total characters24668109
Distinct characters54
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23438 ?
Unique (%)1.8%

Sample

1st rowBulla striata
2nd rowStylopathes columnaris
3rd rowOphiothrix suensonii
4th rowNaria labrolineata
5th rowLysasterias heteractis
ValueCountFrequency (%)
conus 21648
 
0.8%
cerithium 8891
 
0.3%
cambarus 8740
 
0.3%
faxonius 8187
 
0.3%
procambarus 8031
 
0.3%
gracilis 6079
 
0.2%
aricidea 4891
 
0.2%
nassarius 4086
 
0.2%
pagurus 3943
 
0.2%
oliva 3823
 
0.1%
Other values (55326) 2520823
97.0%
2025-01-02T18:30:06.560552image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 3034205
12.3%
i 2321115
 
9.4%
s 1756467
 
7.1%
e 1634157
 
6.6%
r 1566246
 
6.3%
o 1518962
 
6.2%
l 1442638
 
5.8%
t 1302012
 
5.3%
1299571
 
5.3%
u 1297622
 
5.3%
Other values (44) 7495114
30.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 24668109
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 3034205
12.3%
i 2321115
 
9.4%
s 1756467
 
7.1%
e 1634157
 
6.6%
r 1566246
 
6.3%
o 1518962
 
6.2%
l 1442638
 
5.8%
t 1302012
 
5.3%
1299571
 
5.3%
u 1297622
 
5.3%
Other values (44) 7495114
30.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 24668109
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 3034205
12.3%
i 2321115
 
9.4%
s 1756467
 
7.1%
e 1634157
 
6.6%
r 1566246
 
6.3%
o 1518962
 
6.2%
l 1442638
 
5.8%
t 1302012
 
5.3%
1299571
 
5.3%
u 1297622
 
5.3%
Other values (44) 7495114
30.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 24668109
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 3034205
12.3%
i 2321115
 
9.4%
s 1756467
 
7.1%
e 1634157
 
6.6%
r 1566246
 
6.3%
o 1518962
 
6.2%
l 1442638
 
5.8%
t 1302012
 
5.3%
1299571
 
5.3%
u 1297622
 
5.3%
Other values (44) 7495114
30.4%
Distinct94524
Distinct (%)4.9%
Missing2067
Missing (%)0.1%
Memory size14.7 MiB
2025-01-02T18:30:06.767205image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length188
Median length120
Mean length29.47402098
Min length6

Characters and Unicode

Total characters56717507
Distinct characters115
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27025 ?
Unique (%)1.4%

Sample

1st rowSycon Risso, 1827
2nd rowBulla striata Bruguière, 1792
3rd rowStylopathes columnaris (Duchassaing, 1870)
4th rowOphiothrix suensonii Lütken, 1856
5th rowNaria labrolineata (Gaskoin, 1849)
ValueCountFrequency (%)
137132
 
2.0%
linnaeus 102227
 
1.5%
1758 86436
 
1.3%
say 52030
 
0.8%
lamarck 41218
 
0.6%
dall 26280
 
0.4%
1791 25378
 
0.4%
gmelin 24581
 
0.4%
gastropoda 23786
 
0.4%
conus 22951
 
0.3%
Other values (67807) 6231681
92.0%
2025-01-02T18:30:07.029281image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 4975488
 
8.8%
4849378
 
8.6%
i 3756794
 
6.6%
e 3416702
 
6.0%
r 2838441
 
5.0%
s 2680068
 
4.7%
o 2507216
 
4.4%
l 2493287
 
4.4%
n 2462015
 
4.3%
t 1946945
 
3.4%
Other values (105) 24791173
43.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 56717507
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 4975488
 
8.8%
4849378
 
8.6%
i 3756794
 
6.6%
e 3416702
 
6.0%
r 2838441
 
5.0%
s 2680068
 
4.7%
o 2507216
 
4.4%
l 2493287
 
4.4%
n 2462015
 
4.3%
t 1946945
 
3.4%
Other values (105) 24791173
43.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 56717507
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 4975488
 
8.8%
4849378
 
8.6%
i 3756794
 
6.6%
e 3416702
 
6.0%
r 2838441
 
5.0%
s 2680068
 
4.7%
o 2507216
 
4.4%
l 2493287
 
4.4%
n 2462015
 
4.3%
t 1946945
 
3.4%
Other values (105) 24791173
43.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 56717507
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 4975488
 
8.8%
4849378
 
8.6%
i 3756794
 
6.6%
e 3416702
 
6.0%
r 2838441
 
5.0%
s 2680068
 
4.7%
o 2507216
 
4.4%
l 2493287
 
4.4%
n 2462015
 
4.3%
t 1946945
 
3.4%
Other values (105) 24791173
43.7%
Distinct133993
Distinct (%)8.5%
Missing353771
Missing (%)18.4%
Memory size14.7 MiB
2025-01-02T18:30:07.166264image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length85
Median length59
Mean length19.44688666
Min length4

Characters and Unicode

Total characters30582524
Distinct characters78
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51619 ?
Unique (%)3.3%

Sample

1st rowScypha sp.
2nd rowBulla striata
3rd rowStylopathes columnaris
4th rowOphiothrix suensonii
5th rowCypraea labrolineata
ValueCountFrequency (%)
sp 198063
 
6.0%
conus 24328
 
0.7%
cypraea 15395
 
0.5%
cambarus 12003
 
0.4%
cerithium 9397
 
0.3%
orconectes 8683
 
0.3%
procambarus 8141
 
0.2%
nassarius 6728
 
0.2%
gracilis 6632
 
0.2%
terebra 5168
 
0.2%
Other values (70829) 3025211
91.1%
2025-01-02T18:30:07.386482image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 3610431
 
11.8%
i 2750408
 
9.0%
s 2277504
 
7.4%
e 1954344
 
6.4%
r 1901340
 
6.2%
o 1840596
 
6.0%
1747131
 
5.7%
l 1714269
 
5.6%
n 1541700
 
5.0%
t 1537214
 
5.0%
Other values (68) 9707587
31.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 30582524
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 3610431
 
11.8%
i 2750408
 
9.0%
s 2277504
 
7.4%
e 1954344
 
6.4%
r 1901340
 
6.2%
o 1840596
 
6.0%
1747131
 
5.7%
l 1714269
 
5.6%
n 1541700
 
5.0%
t 1537214
 
5.0%
Other values (68) 9707587
31.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 30582524
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 3610431
 
11.8%
i 2750408
 
9.0%
s 2277504
 
7.4%
e 1954344
 
6.4%
r 1901340
 
6.2%
o 1840596
 
6.0%
1747131
 
5.7%
l 1714269
 
5.6%
n 1541700
 
5.0%
t 1537214
 
5.0%
Other values (68) 9707587
31.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 30582524
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 3610431
 
11.8%
i 2750408
 
9.0%
s 2277504
 
7.4%
e 1954344
 
6.4%
r 1901340
 
6.2%
o 1840596
 
6.0%
1747131
 
5.7%
l 1714269
 
5.6%
n 1541700
 
5.0%
t 1537214
 
5.0%
Other values (68) 9707587
31.7%

typifiedName
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

protocol
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing2
Missing (%)< 0.1%
Memory size14.7 MiB
2025-01-02T18:30:07.433906image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters5779161
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEML
2nd rowEML
3rd rowEML
4th rowEML
5th rowEML
ValueCountFrequency (%)
eml 1926387
100.0%
2025-01-02T18:30:07.523470image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 1926387
33.3%
M 1926387
33.3%
L 1926387
33.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 5779161
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
E 1926387
33.3%
M 1926387
33.3%
L 1926387
33.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 5779161
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
E 1926387
33.3%
M 1926387
33.3%
L 1926387
33.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 5779161
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
E 1926387
33.3%
M 1926387
33.3%
L 1926387
33.3%
Distinct209948
Distinct (%)10.9%
Missing2
Missing (%)< 0.1%
Memory size14.7 MiB
2025-01-02T18:30:07.668782image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length24
Median length24
Mean length23.99591152
Min length20

Characters and Unicode

Total characters46225412
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9123 ?
Unique (%)0.5%

Sample

1st row2024-12-02T13:57:44.311Z
2nd row2024-12-02T13:57:20.485Z
3rd row2024-12-02T13:57:18.447Z
4th row2024-12-02T13:57:45.124Z
5th row2024-12-02T13:57:20.489Z
ValueCountFrequency (%)
2024-12-02t13:57:28.783z 37
 
< 0.1%
2024-12-02t13:57:52.889z 37
 
< 0.1%
2024-12-02t13:57:43.700z 36
 
< 0.1%
2024-12-02t13:57:40.815z 36
 
< 0.1%
2024-12-02t13:58:01.714z 36
 
< 0.1%
2024-12-02t13:57:50.671z 35
 
< 0.1%
2024-12-02t13:57:53.093z 35
 
< 0.1%
2024-12-02t13:57:40.927z 35
 
< 0.1%
2024-12-02t13:57:28.440z 35
 
< 0.1%
2024-12-02t13:57:33.269z 35
 
< 0.1%
Other values (209938) 1926030
> 99.9%
2025-01-02T18:30:07.898273image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 8796773
19.0%
0 4884695
10.6%
1 4858658
10.5%
- 3852774
8.3%
: 3852774
8.3%
4 3098095
 
6.7%
5 3058765
 
6.6%
3 3051121
 
6.6%
T 1926387
 
4.2%
Z 1926387
 
4.2%
Other values (5) 6918983
15.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 46225412
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 8796773
19.0%
0 4884695
10.6%
1 4858658
10.5%
- 3852774
8.3%
: 3852774
8.3%
4 3098095
 
6.7%
5 3058765
 
6.6%
3 3051121
 
6.6%
T 1926387
 
4.2%
Z 1926387
 
4.2%
Other values (5) 6918983
15.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 46225412
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 8796773
19.0%
0 4884695
10.6%
1 4858658
10.5%
- 3852774
8.3%
: 3852774
8.3%
4 3098095
 
6.7%
5 3058765
 
6.6%
3 3051121
 
6.6%
T 1926387
 
4.2%
Z 1926387
 
4.2%
Other values (5) 6918983
15.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 46225412
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 8796773
19.0%
0 4884695
10.6%
1 4858658
10.5%
- 3852774
8.3%
: 3852774
8.3%
4 3098095
 
6.7%
5 3058765
 
6.6%
3 3051121
 
6.6%
T 1926387
 
4.2%
Z 1926387
 
4.2%
Other values (5) 6918983
15.0%

lastCrawled
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing2
Missing (%)< 0.1%
Memory size14.7 MiB
2025-01-02T18:30:07.978107image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters46233288
Distinct characters12
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-12-02T11:48:23.416Z
2nd row2024-12-02T11:48:23.416Z
3rd row2024-12-02T11:48:23.416Z
4th row2024-12-02T11:48:23.416Z
5th row2024-12-02T11:48:23.416Z
ValueCountFrequency (%)
2024-12-02t11:48:23.416z 1926387
100.0%
2025-01-02T18:30:08.093826image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 9631935
20.8%
1 7705548
16.7%
4 5779161
12.5%
0 3852774
 
8.3%
- 3852774
 
8.3%
: 3852774
 
8.3%
T 1926387
 
4.2%
8 1926387
 
4.2%
3 1926387
 
4.2%
. 1926387
 
4.2%
Other values (2) 3852774
 
8.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 46233288
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 9631935
20.8%
1 7705548
16.7%
4 5779161
12.5%
0 3852774
 
8.3%
- 3852774
 
8.3%
: 3852774
 
8.3%
T 1926387
 
4.2%
8 1926387
 
4.2%
3 1926387
 
4.2%
. 1926387
 
4.2%
Other values (2) 3852774
 
8.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 46233288
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 9631935
20.8%
1 7705548
16.7%
4 5779161
12.5%
0 3852774
 
8.3%
- 3852774
 
8.3%
: 3852774
 
8.3%
T 1926387
 
4.2%
8 1926387
 
4.2%
3 1926387
 
4.2%
. 1926387
 
4.2%
Other values (2) 3852774
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 46233288
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 9631935
20.8%
1 7705548
16.7%
4 5779161
12.5%
0 3852774
 
8.3%
- 3852774
 
8.3%
: 3852774
 
8.3%
T 1926387
 
4.2%
8 1926387
 
4.2%
3 1926387
 
4.2%
. 1926387
 
4.2%
Other values (2) 3852774
 
8.3%

repatriated
Boolean

Missing 

Distinct2
Distinct (%)< 0.1%
Missing110140
Missing (%)5.7%
Memory size14.7 MiB
True
947669 
False
868580 
(Missing)
110140 
ValueCountFrequency (%)
True 947669
49.2%
False 868580
45.1%
(Missing) 110140
 
5.7%
2025-01-02T18:30:08.143459image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

relativeOrganismQuantity
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

projectId
Unsupported

Missing  Rejected  Unsupported 

Missing1926389
Missing (%)100.0%
Memory size14.7 MiB

isSequenced
Boolean

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing2
Missing (%)< 0.1%
Memory size14.7 MiB
False
1921265 
True
 
5122
(Missing)
 
2
ValueCountFrequency (%)
False 1921265
99.7%
True 5122
 
0.3%
(Missing) 2
 
< 0.1%
2025-01-02T18:30:08.186957image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

gbifRegion
Text

Missing 

Distinct7
Distinct (%)< 0.1%
Missing115674
Missing (%)6.0%
Memory size14.7 MiB
2025-01-02T18:30:08.244252image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length10.88896817
Min length4

Characters and Unicode

Total characters19716818
Distinct characters16
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNORTH_AMERICA
2nd rowNORTH_AMERICA
3rd rowLATIN_AMERICA
4th rowNORTH_AMERICA
5th rowASIA
ValueCountFrequency (%)
north_america 900416
49.7%
latin_america 368762
20.4%
asia 206888
 
11.4%
oceania 167374
 
9.2%
africa 56930
 
3.1%
europe 56674
 
3.1%
antarctica 53671
 
3.0%
2025-01-02T18:30:08.363791image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 3930515
19.9%
R 2336869
11.9%
I 2122803
10.8%
C 1600824
8.1%
E 1549900
 
7.9%
N 1490223
 
7.6%
T 1376520
 
7.0%
M 1269178
 
6.4%
_ 1269178
 
6.4%
O 1124464
 
5.7%
Other values (6) 1646344
8.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19716818
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
A 3930515
19.9%
R 2336869
11.9%
I 2122803
10.8%
C 1600824
8.1%
E 1549900
 
7.9%
N 1490223
 
7.6%
T 1376520
 
7.0%
M 1269178
 
6.4%
_ 1269178
 
6.4%
O 1124464
 
5.7%
Other values (6) 1646344
8.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19716818
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
A 3930515
19.9%
R 2336869
11.9%
I 2122803
10.8%
C 1600824
8.1%
E 1549900
 
7.9%
N 1490223
 
7.6%
T 1376520
 
7.0%
M 1269178
 
6.4%
_ 1269178
 
6.4%
O 1124464
 
5.7%
Other values (6) 1646344
8.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19716818
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
A 3930515
19.9%
R 2336869
11.9%
I 2122803
10.8%
C 1600824
8.1%
E 1549900
 
7.9%
N 1490223
 
7.6%
T 1376520
 
7.0%
M 1269178
 
6.4%
_ 1269178
 
6.4%
O 1124464
 
5.7%
Other values (6) 1646344
8.3%

publishedByGbifRegion
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing2
Missing (%)< 0.1%
Memory size14.7 MiB
2025-01-02T18:30:08.420470image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length13
Min length13

Characters and Unicode

Total characters25043031
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNORTH_AMERICA
2nd rowNORTH_AMERICA
3rd rowNORTH_AMERICA
4th rowNORTH_AMERICA
5th rowNORTH_AMERICA
ValueCountFrequency (%)
north_america 1926387
100.0%
2025-01-02T18:30:08.530819image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
R 3852774
15.4%
A 3852774
15.4%
N 1926387
7.7%
O 1926387
7.7%
T 1926387
7.7%
H 1926387
7.7%
_ 1926387
7.7%
M 1926387
7.7%
E 1926387
7.7%
I 1926387
7.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 25043031
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
R 3852774
15.4%
A 3852774
15.4%
N 1926387
7.7%
O 1926387
7.7%
T 1926387
7.7%
H 1926387
7.7%
_ 1926387
7.7%
M 1926387
7.7%
E 1926387
7.7%
I 1926387
7.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 25043031
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
R 3852774
15.4%
A 3852774
15.4%
N 1926387
7.7%
O 1926387
7.7%
T 1926387
7.7%
H 1926387
7.7%
_ 1926387
7.7%
M 1926387
7.7%
E 1926387
7.7%
I 1926387
7.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 25043031
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
R 3852774
15.4%
A 3852774
15.4%
N 1926387
7.7%
O 1926387
7.7%
T 1926387
7.7%
H 1926387
7.7%
_ 1926387
7.7%
M 1926387
7.7%
E 1926387
7.7%
I 1926387
7.7%

level0Gid
Text

Missing 

Distinct226
Distinct (%)0.1%
Missing1691066
Missing (%)87.8%
Memory size14.7 MiB
2025-01-02T18:30:08.686625image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters705969
Distinct characters28
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)< 0.1%

Sample

1st rowUSA
2nd rowPAN
3rd rowUSA
4th rowUSA
5th rowPAN
ValueCountFrequency (%)
usa 138756
59.0%
pan 11701
 
5.0%
jpn 8794
 
3.7%
mex 4690
 
2.0%
phl 4467
 
1.9%
can 4382
 
1.9%
dom 3446
 
1.5%
cri 3146
 
1.3%
mdg 2984
 
1.3%
pri 2846
 
1.2%
Other values (216) 50111
 
21.3%
2025-01-02T18:30:08.905335image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 169536
24.0%
U 149988
21.2%
S 147144
20.8%
N 36249
 
5.1%
P 32498
 
4.6%
M 17314
 
2.5%
C 16563
 
2.3%
R 16511
 
2.3%
I 11617
 
1.6%
J 11408
 
1.6%
Other values (18) 97141
13.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 705969
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
A 169536
24.0%
U 149988
21.2%
S 147144
20.8%
N 36249
 
5.1%
P 32498
 
4.6%
M 17314
 
2.5%
C 16563
 
2.3%
R 16511
 
2.3%
I 11617
 
1.6%
J 11408
 
1.6%
Other values (18) 97141
13.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 705969
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
A 169536
24.0%
U 149988
21.2%
S 147144
20.8%
N 36249
 
5.1%
P 32498
 
4.6%
M 17314
 
2.5%
C 16563
 
2.3%
R 16511
 
2.3%
I 11617
 
1.6%
J 11408
 
1.6%
Other values (18) 97141
13.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 705969
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
A 169536
24.0%
U 149988
21.2%
S 147144
20.8%
N 36249
 
5.1%
P 32498
 
4.6%
M 17314
 
2.5%
C 16563
 
2.3%
R 16511
 
2.3%
I 11617
 
1.6%
J 11408
 
1.6%
Other values (18) 97141
13.8%

level0Name
Text

Missing 

Distinct226
Distinct (%)0.1%
Missing1691066
Missing (%)87.8%
Memory size14.7 MiB
2025-01-02T18:30:09.059813image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length13
Mean length11.1625043
Min length4

Characters and Unicode

Total characters2626794
Distinct characters62
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)< 0.1%

Sample

1st rowUnited States
2nd rowPanama
3rd rowUnited States
4th rowUnited States
5th rowPanama
ValueCountFrequency (%)
united 139310
34.8%
states 138840
34.7%
panama 11701
 
2.9%
japan 8794
 
2.2%
méxico 4690
 
1.2%
philippines 4467
 
1.1%
canada 4382
 
1.1%
republic 3662
 
0.9%
dominican 3446
 
0.9%
rica 3146
 
0.8%
Other values (265) 77445
19.4%
2025-01-02T18:30:09.281574image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 437054
16.6%
e 318254
12.1%
a 296155
11.3%
i 217274
8.3%
n 208865
8.0%
s 170178
 
6.5%
164560
 
6.3%
d 159630
 
6.1%
S 144075
 
5.5%
U 140430
 
5.3%
Other values (52) 370319
14.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2626794
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t 437054
16.6%
e 318254
12.1%
a 296155
11.3%
i 217274
8.3%
n 208865
8.0%
s 170178
 
6.5%
164560
 
6.3%
d 159630
 
6.1%
S 144075
 
5.5%
U 140430
 
5.3%
Other values (52) 370319
14.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2626794
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t 437054
16.6%
e 318254
12.1%
a 296155
11.3%
i 217274
8.3%
n 208865
8.0%
s 170178
 
6.5%
164560
 
6.3%
d 159630
 
6.1%
S 144075
 
5.5%
U 140430
 
5.3%
Other values (52) 370319
14.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2626794
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t 437054
16.6%
e 318254
12.1%
a 296155
11.3%
i 217274
8.3%
n 208865
8.0%
s 170178
 
6.5%
164560
 
6.3%
d 159630
 
6.1%
S 144075
 
5.5%
U 140430
 
5.3%
Other values (52) 370319
14.1%

level1Gid
Text

Missing 

Distinct1804
Distinct (%)0.8%
Missing1694634
Missing (%)88.0%
Memory size14.7 MiB
2025-01-02T18:30:09.442652image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.672701776
Min length6

Characters and Unicode

Total characters1778187
Distinct characters38
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique305 ?
Unique (%)0.1%

Sample

1st rowUSA.10_1
2nd rowPAN.4_1
3rd rowUSA.14_1
4th rowUSA.16_1
5th rowPAN.12_1
ValueCountFrequency (%)
usa.10_1 18116
 
7.8%
usa.5_1 8182
 
3.5%
usa.43_1 8000
 
3.5%
pan.4_1 7933
 
3.4%
jpn.32_1 6827
 
2.9%
usa.47_1 6423
 
2.8%
usa.21_1 5755
 
2.5%
usa.44_1 5753
 
2.5%
usa.11_1 5094
 
2.2%
usa.9_1 4888
 
2.1%
Other values (1794) 154784
66.8%
2025-01-02T18:30:09.669679image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 319208
18.0%
_ 231589
13.0%
. 231553
13.0%
A 166849
9.4%
U 148282
8.3%
S 146784
8.3%
2 66976
 
3.8%
4 61992
 
3.5%
3 50877
 
2.9%
N 36177
 
2.0%
Other values (28) 317900
17.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1778187
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1 319208
18.0%
_ 231589
13.0%
. 231553
13.0%
A 166849
9.4%
U 148282
8.3%
S 146784
8.3%
2 66976
 
3.8%
4 61992
 
3.5%
3 50877
 
2.9%
N 36177
 
2.0%
Other values (28) 317900
17.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1778187
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1 319208
18.0%
_ 231589
13.0%
. 231553
13.0%
A 166849
9.4%
U 148282
8.3%
S 146784
8.3%
2 66976
 
3.8%
4 61992
 
3.5%
3 50877
 
2.9%
N 36177
 
2.0%
Other values (28) 317900
17.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1778187
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1 319208
18.0%
_ 231589
13.0%
. 231553
13.0%
A 166849
9.4%
U 148282
8.3%
S 146784
8.3%
2 66976
 
3.8%
4 61992
 
3.5%
3 50877
 
2.9%
N 36177
 
2.0%
Other values (28) 317900
17.9%

level1Name
Text

Missing 

Distinct1735
Distinct (%)0.7%
Missing1694634
Missing (%)88.0%
Memory size14.7 MiB
2025-01-02T18:30:09.800664image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length29
Mean length8.969407348
Min length3

Characters and Unicode

Total characters2078705
Distinct characters114
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique295 ?
Unique (%)0.1%

Sample

1st rowFlorida
2nd rowColón
3rd rowIllinois
4th rowIowa
5th rowPanamá
ValueCountFrequency (%)
florida 18120
 
6.1%
california 9283
 
3.1%
carolina 8221
 
2.8%
tennessee 8000
 
2.7%
colón 7933
 
2.7%
virginia 7606
 
2.6%
okinawa 6827
 
2.3%
new 5902
 
2.0%
maryland 5759
 
1.9%
texas 5753
 
1.9%
Other values (1874) 212773
71.8%
2025-01-02T18:30:09.999728image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 294071
14.1%
i 195344
 
9.4%
n 167961
 
8.1%
o 145084
 
7.0%
r 120163
 
5.8%
s 119088
 
5.7%
e 117144
 
5.6%
l 98274
 
4.7%
t 78261
 
3.8%
64422
 
3.1%
Other values (104) 678893
32.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2078705
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 294071
14.1%
i 195344
 
9.4%
n 167961
 
8.1%
o 145084
 
7.0%
r 120163
 
5.8%
s 119088
 
5.7%
e 117144
 
5.6%
l 98274
 
4.7%
t 78261
 
3.8%
64422
 
3.1%
Other values (104) 678893
32.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2078705
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 294071
14.1%
i 195344
 
9.4%
n 167961
 
8.1%
o 145084
 
7.0%
r 120163
 
5.8%
s 119088
 
5.7%
e 117144
 
5.6%
l 98274
 
4.7%
t 78261
 
3.8%
64422
 
3.1%
Other values (104) 678893
32.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2078705
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 294071
14.1%
i 195344
 
9.4%
n 167961
 
8.1%
o 145084
 
7.0%
r 120163
 
5.8%
s 119088
 
5.7%
e 117144
 
5.6%
l 98274
 
4.7%
t 78261
 
3.8%
64422
 
3.1%
Other values (104) 678893
32.7%

level2Gid
Text

Missing 

Distinct7611
Distinct (%)3.5%
Missing1708980
Missing (%)88.7%
Memory size14.7 MiB
2025-01-02T18:30:10.143205image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length12
Median length11
Mean length10.36195374
Min length7

Characters and Unicode

Total characters2252782
Distinct characters38
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1730 ?
Unique (%)0.8%

Sample

1st rowUSA.10.59_1
2nd rowPAN.4.2_1
3rd rowUSA.14.18_1
4th rowUSA.16.3_1
5th rowPAN.12.2_1
ValueCountFrequency (%)
jpn.32.28_1 6059
 
2.8%
usa.10.43_1 6013
 
2.8%
pan.4.2_1 5746
 
2.6%
usa.9.1_1 4888
 
2.2%
usa.10.44_1 4299
 
2.0%
usa.22.1_1 3251
 
1.5%
mdg.2.1_1 2723
 
1.3%
dom.29.3_1 2676
 
1.2%
cri.5.2_1 2210
 
1.0%
pan.4.5_1 2107
 
1.0%
Other values (7601) 177437
81.6%
2025-01-02T18:30:10.443331image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 434450
19.3%
1 372514
16.5%
_ 217409
9.7%
A 164597
 
7.3%
U 146942
 
6.5%
S 144683
 
6.4%
2 131222
 
5.8%
4 103045
 
4.6%
3 93136
 
4.1%
5 60231
 
2.7%
Other values (28) 384553
17.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2252782
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
. 434450
19.3%
1 372514
16.5%
_ 217409
9.7%
A 164597
 
7.3%
U 146942
 
6.5%
S 144683
 
6.4%
2 131222
 
5.8%
4 103045
 
4.6%
3 93136
 
4.1%
5 60231
 
2.7%
Other values (28) 384553
17.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2252782
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
. 434450
19.3%
1 372514
16.5%
_ 217409
9.7%
A 164597
 
7.3%
U 146942
 
6.5%
S 144683
 
6.4%
2 131222
 
5.8%
4 103045
 
4.6%
3 93136
 
4.1%
5 60231
 
2.7%
Other values (28) 384553
17.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2252782
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
. 434450
19.3%
1 372514
16.5%
_ 217409
9.7%
A 164597
 
7.3%
U 146942
 
6.5%
S 144683
 
6.4%
2 131222
 
5.8%
4 103045
 
4.6%
3 93136
 
4.1%
5 60231
 
2.7%
Other values (28) 384553
17.1%

level2Name
Text

Missing 

Distinct6183
Distinct (%)2.8%
Missing1709046
Missing (%)88.7%
Memory size14.7 MiB
2025-01-02T18:30:10.591356image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length29
Mean length8.376538467
Min length1

Characters and Unicode

Total characters1820582
Distinct characters147
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1556 ?
Unique (%)0.7%

Sample

1st rowSeminole
2nd rowColón
3rd rowCumberland
4th rowAllamakee
5th rowChepo
ValueCountFrequency (%)
san 6246
 
2.3%
onna 6059
 
2.2%
miami-dade 6013
 
2.2%
colón 5755
 
2.1%
of 5128
 
1.9%
columbia 5068
 
1.8%
monroe 4935
 
1.8%
district 4903
 
1.8%
de 3904
 
1.4%
barnstable 3251
 
1.2%
Other values (6462) 224554
81.4%
2025-01-02T18:30:10.804986image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 214954
 
11.8%
n 155743
 
8.6%
e 142975
 
7.9%
o 137584
 
7.6%
i 116282
 
6.4%
r 99257
 
5.5%
l 83850
 
4.6%
t 80842
 
4.4%
s 71221
 
3.9%
58473
 
3.2%
Other values (137) 659401
36.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1820582
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 214954
 
11.8%
n 155743
 
8.6%
e 142975
 
7.9%
o 137584
 
7.6%
i 116282
 
6.4%
r 99257
 
5.5%
l 83850
 
4.6%
t 80842
 
4.4%
s 71221
 
3.9%
58473
 
3.2%
Other values (137) 659401
36.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1820582
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 214954
 
11.8%
n 155743
 
8.6%
e 142975
 
7.9%
o 137584
 
7.6%
i 116282
 
6.4%
r 99257
 
5.5%
l 83850
 
4.6%
t 80842
 
4.4%
s 71221
 
3.9%
58473
 
3.2%
Other values (137) 659401
36.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1820582
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 214954
 
11.8%
n 155743
 
8.6%
e 142975
 
7.9%
o 137584
 
7.6%
i 116282
 
6.4%
r 99257
 
5.5%
l 83850
 
4.6%
t 80842
 
4.4%
s 71221
 
3.9%
58473
 
3.2%
Other values (137) 659401
36.2%

level3Gid
Text

Missing 

Distinct3019
Distinct (%)7.6%
Missing1886622
Missing (%)97.9%
Memory size14.7 MiB
2025-01-02T18:30:10.945562image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length15
Median length11
Mean length11.67430281
Min length11

Characters and Unicode

Total characters464252
Distinct characters35
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1210 ?
Unique (%)3.0%

Sample

1st rowPAN.4.2.6_1
2nd rowPAN.12.2.2_1
3rd rowMMR.4.2.6_1
4th rowPAN.12.1.4_1
5th rowCAN.9.20.18_1
ValueCountFrequency (%)
pan.4.2.4_1 3201
 
8.0%
mdg.2.1.5_1 2581
 
6.5%
pan.4.2.6_1 2281
 
5.7%
cri.5.2.1_1 2199
 
5.5%
pan.4.5.5_1 1729
 
4.3%
can.6.2.11_1 743
 
1.9%
pan.11.1.5_1 729
 
1.8%
phl.20.2.8_1 443
 
1.1%
phl.25.27.3_1 382
 
1.0%
pan.12.1.4_1 370
 
0.9%
Other values (3009) 25109
63.1%
2025-01-02T18:30:11.149716image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 119301
25.7%
1 77734
16.7%
_ 39767
 
8.6%
2 30156
 
6.5%
4 20559
 
4.4%
N 19748
 
4.3%
A 19069
 
4.1%
P 17786
 
3.8%
5 17379
 
3.7%
C 11012
 
2.4%
Other values (25) 91741
19.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 464252
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
. 119301
25.7%
1 77734
16.7%
_ 39767
 
8.6%
2 30156
 
6.5%
4 20559
 
4.4%
N 19748
 
4.3%
A 19069
 
4.1%
P 17786
 
3.8%
5 17379
 
3.7%
C 11012
 
2.4%
Other values (25) 91741
19.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 464252
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
. 119301
25.7%
1 77734
16.7%
_ 39767
 
8.6%
2 30156
 
6.5%
4 20559
 
4.4%
N 19748
 
4.3%
A 19069
 
4.1%
P 17786
 
3.8%
5 17379
 
3.7%
C 11012
 
2.4%
Other values (25) 91741
19.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 464252
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
. 119301
25.7%
1 77734
16.7%
_ 39767
 
8.6%
2 30156
 
6.5%
4 20559
 
4.4%
N 19748
 
4.3%
A 19069
 
4.1%
P 17786
 
3.8%
5 17379
 
3.7%
C 11012
 
2.4%
Other values (25) 91741
19.8%

level3Name
Text

Missing 

Distinct2869
Distinct (%)7.3%
Missing1887342
Missing (%)98.0%
Memory size14.7 MiB
2025-01-02T18:30:11.311320image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length29
Mean length9.372038825
Min length2

Characters and Unicode

Total characters365950
Distinct characters125
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1138 ?
Unique (%)2.9%

Sample

1st rowCristóbal
2nd rowChepillo
3rd rowMyitkyina
4th rowPedro González
5th rowKenora, Unorganized
ValueCountFrequency (%)
cativá 3201
 
5.8%
nosibe 2581
 
4.7%
cristóbal 2281
 
4.1%
limon 2199
 
4.0%
portobelo 1729
 
3.1%
harbour 745
 
1.4%
sachs 743
 
1.3%
veracruz 729
 
1.3%
santa 615
 
1.1%
unorganized 585
 
1.1%
Other values (3190) 39688
72.0%
2025-01-02T18:30:11.540506image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 46333
 
12.7%
o 27885
 
7.6%
i 25003
 
6.8%
n 22399
 
6.1%
r 19868
 
5.4%
e 19376
 
5.3%
t 17849
 
4.9%
16049
 
4.4%
l 15536
 
4.2%
s 13794
 
3.8%
Other values (115) 141858
38.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 365950
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 46333
 
12.7%
o 27885
 
7.6%
i 25003
 
6.8%
n 22399
 
6.1%
r 19868
 
5.4%
e 19376
 
5.3%
t 17849
 
4.9%
16049
 
4.4%
l 15536
 
4.2%
s 13794
 
3.8%
Other values (115) 141858
38.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 365950
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 46333
 
12.7%
o 27885
 
7.6%
i 25003
 
6.8%
n 22399
 
6.1%
r 19868
 
5.4%
e 19376
 
5.3%
t 17849
 
4.9%
16049
 
4.4%
l 15536
 
4.2%
s 13794
 
3.8%
Other values (115) 141858
38.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 365950
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 46333
 
12.7%
o 27885
 
7.6%
i 25003
 
6.8%
n 22399
 
6.1%
r 19868
 
5.4%
e 19376
 
5.3%
t 17849
 
4.9%
16049
 
4.4%
l 15536
 
4.2%
s 13794
 
3.8%
Other values (115) 141858
38.8%

iucnRedListCategory
Text

Missing 

Distinct9
Distinct (%)< 0.1%
Missing469562
Missing (%)24.4%
Memory size14.7 MiB
2025-01-02T18:30:11.592321image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters2913654
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNE
2nd rowNE
3rd rowNE
4th rowNE
5th rowNE
ValueCountFrequency (%)
ne 1307916
89.8%
lc 117121
 
8.0%
dd 11259
 
0.8%
nt 6488
 
0.4%
vu 6192
 
0.4%
cr 3404
 
0.2%
en 3150
 
0.2%
ex 1118
 
0.1%
ew 179
 
< 0.1%
2025-01-02T18:30:11.696557image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 1317554
45.2%
E 1312363
45.0%
C 120525
 
4.1%
L 117121
 
4.0%
D 22518
 
0.8%
T 6488
 
0.2%
V 6192
 
0.2%
U 6192
 
0.2%
R 3404
 
0.1%
X 1118
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2913654
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N 1317554
45.2%
E 1312363
45.0%
C 120525
 
4.1%
L 117121
 
4.0%
D 22518
 
0.8%
T 6488
 
0.2%
V 6192
 
0.2%
U 6192
 
0.2%
R 3404
 
0.1%
X 1118
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2913654
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N 1317554
45.2%
E 1312363
45.0%
C 120525
 
4.1%
L 117121
 
4.0%
D 22518
 
0.8%
T 6488
 
0.2%
V 6192
 
0.2%
U 6192
 
0.2%
R 3404
 
0.1%
X 1118
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2913654
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N 1317554
45.2%
E 1312363
45.0%
C 120525
 
4.1%
L 117121
 
4.0%
D 22518
 
0.8%
T 6488
 
0.2%
V 6192
 
0.2%
U 6192
 
0.2%
R 3404
 
0.1%
X 1118
 
< 0.1%